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Chapter  1 


Introduction 


1.1  Background  and  Motivation 


The  Large  Space  Structure  (LSS)  research  program  was  originally  formulated  in  late  1982  in  re¬ 
sponse  to  the  increasing  concern  that  performance  robustness  of  Air  Force  LSS  type  system  would 
be  inadequate  to  meet  mission  objectives.  In  particular,  uncertainties  in  both  system  dynamics  and 
disturbance  spectra  characterizations  (both  time  varying  and  stochastic  uncertainty)  significantly 
limit  the  performance  attainable  with  fixed  gain,  fixed  architecture  controls.  Therefore,  the  use 
of  an  adaptive  system,  where  disturbances  and/or  plant  models  are  identified  prior  to  or  during 
control,  gives  systems  designers  more  options  for  minimizing  the  risk  in  achieving  performance 
objectives. 

-  The  aim  of  adaptive  control  is  to  implement  in  real-time  and  on-line  as  many  as  possible  of  the  - 
design  functions  now  performed  off-line  by  the  control  engineer;  to  give  the  controller  ‘'intelligence’^ 
To  realize  this  aim,  both  a  theory  of  stability  and  performance  of  such  inherently  nonlinear  controls 
is  essential  as  well  as  a  technology  capable  of  achieving  the  implementation.') 

The  issues  of  performance  sensitivity,  robustness,  and  achievement  of  very  high  performance 
in  an  LSS  system  can  be  effectively  addressed  using  adaptive  algorithms.  '^The  need  to  identify 
modal  frequencies,  for  example,  in  high-performance  disturbance  rejection  systems  has  been  shown 
in  ACOSS  (1981)  and  VCOSS  (1982).  The  deployment  of  high-performance  optical  or  RF  systems 
may  require  on-line  identification  of  critical  modal  parameters  before  full  control  authority  can  be 
exercised.  Parameter  sensitivity,  manifested  by  performance  degradation  or  loss  of  stability  (poor 
robustness)  may  be  effectively  reduced  by  adaptive  feedback  mechanizations.  Reducing  the  effects  of 
on-board  disturbance  rejection)  is  particularly  important  for  planned  Air  Force  missions.  For  these 
cases,  adaptive  control  mechanizations  are  needed  to  produce  the  three-to-five  orders-of-magnitude 
reductions  in  line-of-sight  jitter  required  by  the  mission. 

Research  is  essential  to  identify  the  performance  limitations  of  adaptive  strategies  for  LSS  con¬ 
trol  both  from  theoretical  and  hardware  mechanization  viewpoints.  The  long  range  goal  of  this 
research  program  is  to  establish  guidelines  for  selecting  the  appropriate  strategy,  to  evaluate  per¬ 
formance  improvements  over  fixed-gain  mechanizations,  and  to  examine  the  iii  <  liit^cture  necessary 
to  produce  a  practical  hardware  realization.  The  initial  thrust,  however,  is  i<>  imitinue  to  build  a 
strong  theoretical  foundation  without  losing  sight  of  the  practical  implemenfai  icm  issues. 
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1.2  Research  Objectives 


The  aims  of  this  research  study  are  to  extend  and  develop  adaptive  control  theory  and  its  application 
to  LSS  in  several  directions.  These  include: 

1.  Theoretical  Development:  The  initial  emphasis  has  been  on  slow  adaptation,  since  this 
covers  may  LSS  situations.  Later  on  we  will  examine  fast  adaptation.  The  theory  developed 
here  will  provide  for: 

(a)  estimates  of  robustness,  i.e.,  stability  margins  vs.  performance  bounds; 

(b)  estimates  of  regions  of  attraction  and  rates  of  parameter  convergence  to  these  regions; 

(c)  extension  of  the  present  linear  finite  dimensional  adaptive  theory  to  include  nonbnear 
and  infinite  dimensional  plants  and  controller  structures;  and 

(d)  extensions  to  decentralized  systems. 

2.  Parameter  Adaptive  Algorithms  Assess  the  behavior  of  different  algorithms,  including: 
gradient,  recursive  least  squares,  normalized  least  mean  squares,  and  nonlinear  observer  (e.g. 
Extended  Kalman  Filter). 

3.  Parametric  Models:  Assess  the  impact  of  model  choices.  In  particular  we  will  examine 
the  effect  of  explicit  and  implicit  model  choices.  An  explicit  model,  for  example,  is  a  transfer 
function  whose  coefficients  are  all  unknown.  In  an  implicit  model  transfer  function,  the 
coefficients  would  be  functions  of  some  other  parameters.  Implicit  models  usually  arise  from 
physical  or  experimental  data,  whereas  explicit  models  are  selected  for  analytical  convenience. 

4.  Adaptive  Nonlinear  Control:  Although  our  early  effort  is  to  study  adaptive  linear  control, 
there  are  may  LSS  situations  where  the  control  is  nonlinear,  e.g.,  large  angle  maneuvers, 
slewing. 

1.3  Current  Status 

At  the  present  time  we  stand  at  the  beginning  stages  of  the  theoretical  development  in  adaptive 
control.  The  result  of  recent  efforts  are  contained  in  the  selected  papers  in  the  Appendix  and  the 
references  therein.  A  summary  of  earlier  efforts  is  contained  in  the  recently  published  textbook 
Stability  of  Adaptive  Systems:  PiLssivity  and  Averaging  Analysis,  MIT  Press,  1986.  This  publica¬ 
tion  is  an  outgrowth  of  research  supported  under  this  contract  and  involved  a  considerable  amount 
of  collaborative  effort  among  several  researchers  in  the  field  of  adaptive  control.  The  text  discusses 
adaptive  systems  from  the  viewpoint  of  stability  theory.  The  emphasis  is  on  methodology  and  basic 
concepts,  rather  than  on  details  of  adaptive  algorithm.  The  analysis  reveals  common  properties 
including  causes  and  mechanisms  for  instability  and  the  means  to  counteract  them.  Conditions 
for  stability  are  presented  under  slow  adaptation,  where  the  method  of  averaging  is  utilized.  In 
this  latter  case  the  stability  result  is  local,  i.e.,  the  initial  parametrization  and  input  spectrum  is 
constrained.  Based  on  this  analysis,  a  conceptual  fraunework  is  now  available  to  pursue  the  issues 
of  slow  adaptive  control  of  LSS. 

To  remove  the  restrictiveness  of  slow  adaptation  requires  an  understanding  of  the  transient 
behavior  of  adaptive  systems.  A  preliminary  investigation  is  reported  in  Kosut  et  al  (1986).  The 
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transient  behavior  of  not-slow  or  even  rapid  adaptation  is  a  significant  problem  in  the  adaptive 
control  of  LSS,  e.g.,  rapid  retargeting. 

Another  approach  to  adaptive  control  is  to  calibrate  (or  tune)  the  controller  based  on  a  current 
estimate  of  the  LSS  model.  This  involves  not  just  knowing  one  model,  but  rather,  a  model  set.  This 
problem,  which  we  refer  to  as  adaptive  calibration,  is  essentially  that  of  developing  a  technique  of 
on-line  robust  control  design  from  an  identified  model.  Although  we  have  worked  on  this  problem 
for  some  time  it  is  only  recently  that  we  have  established  a  theoretical  basis  for  estimating  model 
error  from  system  identification,  Kosut(1987,1988),  and  this  Appendix.  This  research  has  raised 
many  new  questions  which  need  to  be  considered,  e.g.,  what  is  the  appropriate  robust  controller 
parametrization;  how  does  it  relate  to  model  parametrization;  how  to  iterate  on  the  data  if  the 
estimate  of  model  error  is  too  large;  what  are  the  heuristics  for  experiment  design. 


1.4  Organization  of  Report 

Throughout  the  remainder  of  the  report  we  will  discuss  our  activities  this  past  year  in  uncertainty 
estimation  and  adaptive  nonlinear  control. 

The  Appendix  contains  a  preprint  of  an  extensive  journal  article  on  uncertainty  estimation,  in 
particular,  on  the  subject  of  parameter  set  estimation.  There  still  remains  much  to  be  done  in  this 
area,  particularly  in  regard  to  control  design  using  models  with  uncertain  parameters.  Another 
major  effort  in  this  area  is  in  the  use  of  fractional  representations  to  ciid  in  system  identification 
in  closed-loop.  As  the  work  is  stiU  in  progress,  we  postpone  discussion  of  this  subject  for  the  next 
report,  although  a  conference  paper  will  be  presented  in  1989  (see  below  ).  There  are  now  many 
researchers  interested  in  this  approach,  and  two  invited  sessions  have  been  organized  for  the  1989 
ACC. 

In  Chapter  2  we  discuss  a  new  research  direction,  namely,  the  adaptive  control  of  nonlinear 
flexible  systems.  Here  we  encounter  unique  nonlinear  phenomena  such  as  limit  cycles,  bifurcations, 
and  chaos.  An  e-xample  is  presented  which  demonstrates  adaptation  of  a  chaotic  system.  This  area 
is  still  very  new,  thus,  much  of  our  discussion  is  in  the  future  tense,  that  is,  what  we  hope  can  be 
done  in  future  research. 


1.5  Selected  Publications  to  Date 

1.5.1  Journals  and  Conferences 

R.L.  Kosut,  “Adaptive  Control  via  Parameter  Set  Estimation”,  to  appear.  Ini.  J.  of  Adaptive 
Control  and  Signal  Processing,  1989. 

R.L.  Kosut,  “Adaptive  Control  of  a  Chaotic  System”,  to  appear,  Proc.  1988  ACC,  Pittsburg,  PA, 
June  1989. 

F.  Hanson,  R.L.  Kosut,  and  G.F.  Franklin,  “Closed-Loop  System  Identification  via  the  Fractional 
Representation:  Experiment  Design  and  Unmodeled  Dynamics”  ,  to  appear,  Proc.  1988 
ACC,  Pittsburg,  PA,  June  1989. 

R.L.  Kosut,  “On  The  Use  of  The  Method  of  Averaging  for  the  Stability  Analysis  of  Adaptive 
Linear  Control  Systems”,  Proc.  IEEE  CDC,  Los  Angeles,  CA,  Dec.  1987. 
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R.L.  Kosut,  “Conditions  for  Convergence  and  Divergence  of  Parameter  Adaptive  Linear  Systems”, 
Proc.  ISCAS  1987,  Philda.,  PA,  May,  1987. 

R.L.  Kosut,  “Adaptive  Calibration:  An  approach  to  Uncertainty  Modeling  and  On-Line  Robust 
Control  Design”,  Proc.  25th  IEEE  CDC,  Athens,  Greece,  Dec.  1986. 

R.L.  Kosut,  “Towards  an  On-Line  Procedure  for  Automated  Robust  Control  Design:  The  Adaptive 
Calibration  Problem”,  presented  at  1986  ACC,  June  1986. 

R.L.  Kosut, ^  I.M.Y.  Mareels,  B.D.O.  Anderson,  R.R.  Bitmead,  and  C.R.  Johnson,  Jr.,  “Transient 
Analysis  of  Adaptive  Control”,  submitted  to  IFAC  10th  World  Congress,  Munich,  Germany, 
July  1987. 

R.L.  Kosut,^  and  R.R.  Bitmead,  “FLxed-Point  Theorems  for  Stability  Analysis  of  Adaptive  Sys¬ 
tems”,  Proc.  IFAC  Workshop  on  Adaptive  Systems,  Lund,  Sweden,  July  1986. 

R.L.  Kosut,*  and  R.R.  Bitmead,  “Linearization  of  Adaptive  Systems:  A  FLxed-Point  Analy¬ 
sis”,  submitted,  IEEE  Trans,  on  Circuits  and  Systems;  Special  Issue  on  Adaptive  Systems, 
Sept.  1987. 

I.M.Y.  Mareels,  R.R.  Bitmead,  M.  Gevers,  C.R.  Johnson,  Jr.,  R.L.  Kosut,*  and  M.A.  Poubelle, 
“How  Exciting  Can  a  Signal  Really  Be?”,  to  appear,  Systems  and  Control  Letters. 

R.L.  Kosut,*  B.D.O.  .Anderson,  and  I.M.Y.  Mareels,  “Stability  Theory  for  Adaptive  Systems: 
Methods  of  Averaging  and  Persistency  of  Excitation”,  IEEE  Trans,  on  Aut.  Contr.,  to 
appear,  Jan.  1987. 

B.D.O.  Anderson,  R.R.  Bitmead,  C.R.  Johnson,  Jr.,  and  R.L.  Kosut,  “Stability  Theorems  for  the 
Relaxation  of  the  SPR  Condition  in  Hyperstable  Adaptive  Systems”,  IEEE  Trans,  on  Aut. 
Contr.,  submitted. 

R.L.  Kosut  and  C.R.  Johnson,  Jr.,  “An  Input-Output  View  of  Robustness  in  Adaptive  Control”, 
Automatica:  Special  Issue  on  Adaptive  Control,  20(5):569-581,  Sept.  1984. 

R.L.  Kosut,^  and  B.  Friedlander,  “Robust  Adaptive  Control:  Conditions  for  Global  Stability”, 
IEEE  Trans,  on  Aut.  Contr.,  AC-30(7):610-624,  July  1985. 

1.5.2  Books 

B.D.O.  Anderson,  R.R.  Bitmead,  C.R.  Johnson,  Jr.,  P.V.  Kokotovic,  R.L.  Kosut,  I.M.Y.  Mareels, 
L.  Praly,  and  B.D.  Riedle,  Stability  of  Adaptive  Systems:  Passivity  and  Averaging  Analysis, 
MIT  Press,  1986. 

R.L.  Kosut,  “Methods  of  Averaging  for  Adaptive  Systems”,  Adaptive  Systems:  Theory  and  Ap¬ 
plications,  Editor:  K.S.  Narendra,  Plenum  Press,  1986. 

R.L.  Kosut  and  M.G.  Lyons,  “Issues  in  Control  Design  for  Large  Space  Structures”,  Adaptive 
Systems:  Theory  and  Applications,  Editor:  K.S.  Narendra,  Plenum  Press,  1986. 

'Research  performed  while  R.L.  Kosut  was  a  Visiting  Fellow  at  the  Australian  National  University, 

^Started  under  contract  F4920-81-C-0051. 
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R.L.  Kosut,  “Adaptive  Control  of  Large  Space  Structures:  Uncertainty  Estimation  and  Robust 
Control  Calibration”,  Large  Space  Structures:  Dynamics  and  Control,  Editors:  S.N.  Atluri 
and  A.K.  Amos,  Springer- Verlag,  1987. 

E.  Crawley,  R.L.  Kosut,  and  S.  Hall,  Controlled  Structures,  in  preparation. 


1.5.3  Other  Related  Publications 

A.M  PascoaJ,  R.L.  Kosut,  D.  Meldrum,  M.  Workman,  and  G.F.  Franklin,  “Adaptive  Time-Optimal 
Control  of  Flexible  Structures”,  to  appear,  Proc.  1988  ACC,  Pittsburg,  PA,  June  1989. 

M.L.  Workman,  R.L.  Kosut,^  and  Franklin,  “Adaptive  Proximate  Time-Optimal  Servomecha¬ 
nisms:  Continuous-Time  Case”,  Proc.  ACC,  pp.589-594,  June  1987,  Minneapolis,  MN. 

M.L.  Workman,  R.L.  Kosut^,  and  Franklin,  “Adaptive  Proximate  Time-Optimal  Servomecha¬ 
nisms:  Discrete-Time  Case”,  Proc.  CDC,  Dec.  1987,  Los  Angeles,  CA. 

R. L.  Kosut, ”*  A.  Pascoal,  S.  Morrison,  and  M.L.  Workman,  “Time-Optimal  Control  of  Large  Space 

Structures”,  Proc.  SPIE,  Jan.  1988,  Los  Angeles,  CA. 

S.  Philips,  R.L.  Kosut^,  and  G.F.  Franklin,  “An  Averaging  Analysis  of  Discrete-Time  Indirect 

Adaptive  Control”,  to  appear,  Proc.  ACC,  June  1988,  Atlanta,  GA. 


^Research  supported  partly  by  NSF  Industry/  University  Cooperative  Research  Program  under  Grant  ECS- 
8605646. 

^Resarch  supported  by  SDIO/IST  and  managed  by  AFOSR. 
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Chapter  2 


Overview  of  Research  Activities 


A  new  objective  for  this  years  work  is  the  development  of  mathematical  and  computational  tools  for 
the  analysis  and  synthesis  of  adaptive  control  of  nonlinear  flexible  systems.  With  few  exceptions, 
practically  ail  of  recent  research  in  adaptive  control  is  for  the  case  of  adaptive  /mear  systems,  i.e., 
when  the  adaptive  parameters  are  held  fixed,  the  resulting  system  is  linear  and  time-invariant.  The 
research  issues  we  discuss  here  arise  not  only  from  the  nonlinearity  introduced  by  the  adaptation, 
but  the  nonlinearities  present  in  the  mechanical  system  itself.  Both  of  these  types  of  nonlinearities 
can  produce  bifurcations  and  chaotic  effects.  The  principal  aims  of  the  this  research  include  un¬ 
derstanding  the  causes  of  these  phenomena,  how  to  prevent  them  from  occurring,  how  to  minimize 
any  deleterious  effects,  and  how  to  recog.iize  them  from  measurements. 

Nonlinearities  in  flexible  mechanical  systems  arise  from  a  number  of  sources  including; 

•  actuator  saturation 

•  friction  in  bearings  or  gear  trains 

•  backlash  due  to  gear  spacing  or  slippage 

•  kinematic  transformations 

•  aerodynamic  forces 

Each  of  these  types  of  nonlinearities  can  influence  the  design  of  the  controller.  For  example, 
efficient  use  of  actuator  authority  in  the  presence  of  saturation  motivates  time-optimal  control. 
Friction  can  be  offset  by  either  direct  cancelation  or  dither.  The  effects  of  backlash  can  be  handled 
with  a  controller  that  includes  a  pre-load.  Nonlinearities  arising  from  kinematic  transformatioms 
or  aerodynamic  forces  can  be  accomodated  by  using  a  controller  that  either  incorporates  gain 
scheduling  as  a  function  of  equilibrium  points,  or  else  provides  a  "feedback  linearization”. 

In  a  general  way  the  control  structure  depends  on  the  type  of  plant  nonlinearity  and  some 
associated  parameters.  In  many  cases  the  type  of  nonlinearity  as  well  as  these  parameter  values 
are  not  very  well  known  and  prior  knowledge  is  too  coarse  to  guarantee  acceptable  closed-loop 
performance.  Under  these  conditions,  it  is  necessary  to  adapt  the  controller  to  existing  conditions. 

An  adaptive  control  system  can  adjust  to  uncertainty  by  tuning  controller  parameters  in  such 
a  way  as  to  minimize  model  parameter  inaccuracies  as  we’l  as  nullifying  the  effect  of  inaccurately 
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(or  even  unknown)  nonlinearities.  There  are  many  ways  to  design  or  configure  an  adaptive  control 
system.  A  “generic”  configuration  for  an  adaptive  control  system  is  depicted  in  Figure  2.1.  The 
control  system  consists  of  two  processes  which  make  it  adaptive,  namely:  (1)  a  model  para^jeter 
estimator  (PE),  and  (2)  a  control  design  rule  (D). 


Figure  2.1:  Generic  Adaptive  Control  System 

The  parameter  estimator  (PE)  operates  on  the  input-output  data  obtained  from  measurements 
(t/,u)  of  the  plant  system  (P)  and  produces  a  model  parameter  estimate  9  G  IR,'’.  The  parameter 
estimate  is  transformed  by  the  control  design  rule  (D)  into  a  controller  parameter  p  G  IR^,  which 
is  then  used  in  a  pre-determined  parametric  controller  structure  (C)  in  feedback  with  the  actual 
system  (P). 

In  general,  it  is  very  ea.sy  to  construct  an  adaptive  system:  just  connect  a  design  rule  and  an 
estimator  together.  However,  it  is  very  difficult  to  insure  that  the  resulting  adaptive  system  will 
provide  acceptable  performance.  This  has  been  the  goal  of  research  in  this  area  for  over  30  years. 


2.1  History  of  Adaptive  Control  Approaches 


Over  the  years,  the  application  of  adaptive  control  has  not  met  with  a  uniform  success.  A  good  ex¬ 
ample  is  provided  by  a  brief  history  of  the  adaptive  control  technique  developed  by  Whitaker(1959), 
later  referred  to  as  the  “M.I.T.-Rule”.  The  basic  idea  is  to  adjust  some  controller  parameters  in 
accordance  with  an  instantaneous  gradient  descent  of  the  squared  error  between  the  plant  output 
and  the  output  of  a  so-called  reference  model.  Some  early  applications  to  flight  control  were  re¬ 
ported  to  be  quite  successful  e.g.,  [VVhitaker(1959)]  and  [Osborne  et  al.  (1961)].  Later  reported 
applications-again  to  flight  control-  were  not  deemed  successful,  e.g.,  [Bryson(1977)].  At  the  same 
time  as  these  mixed  results  were  being  obtained  in  the  area  of  flight  control  applications,  several 
researchers  were  reporting  good  results  with  applications  in  process  control,  engine  control,  and 
ship  steering,  e.g.,  [Draper  and  Li(1951),  Astrom  et  al.  (1965,1973)].  Probably  because  of  these 
mixed  results  and  lack  of  theory,  other  approaches  to  adaptive  control  were  explored. 
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Analysis  via  Passivity 


One  such  avenue  was  announced  in  a  paper  by  Parks(1966),  where  the  M.I.T.-Rule  was  “re¬ 
designed”  in  such  a  way  that  the  resulting  adaptively  controlled  system  had  a  guaranteed  stability 
property.  The  idea  is  to  select  the  reference  model  so  that  a  certain  (closed-loop)  transfer  function 
has  a  passivity  property,  specfically,  strictly  positive  real  (SPR).^  With  only  this  SPR  information 
about  this  transfer  function,  the  adaptive  parameters  can  always  be  adjusted  towards  the  minimum 
of  some  objective  function.  This  “SPR- Rule”  idea  was  developed  later  for  more  general  situations, 
e.g.,  Monopoli(1974),  Egardt(1979),  and  Narendra,  Lin,  and  Valavani(1980).  Further  e.vtentions 
and  e.xpositions  along  the  SPR  line  can  be  found  in  Landau(1979),  Goodwin  and  Sin(1983),  and 
Kosut  and  Friedlander(1985). 


Effect  of  Unmodeled  Dynamics 

Unfortunately,  these  SPR- Rule  algorithms  have  e.xactly  the  same  stability  problems  as  the  MIT- 
Rule  algorithms,  but  only  when  unmodeled  dynamics  and  disturbances  are  taken  into  account. 
Altliough  an  early  account  was  given  by  Egardt(1978),  this  issue  was  perhaps  not  fuUy  realized  until 
the  appearance  of  the  work  of  Rohrs  et  al  (1982),  which  vividly  demonstrated  the  potentially  local 
unstable  nature  of  adaptive  systems  by  posing  a  “counter-example”  representative  of  a  non-ideal, 
but  practical,  circumstance.  The  adaptive  parameters  exhibited  the  characteristic  rapid  transient 
followed  by  a  steady  parameter  drift.  In  this  case,  however,  the  parameters  did  not  settle  down  in 
the  constant  parameter  stability  set.  Thus,  once  the  parameters  drifted  outside  the  stability  set, 
the  states  of  the  controlled  system  became  exceedingly  large. 


Cause  of  Parameter  Drift  Instability 

In  scanning  the  literature,  the  earliest  reference  which  contains  a  rigorous  analysis  of  this  instability 
phenomena  appears  to  be  in  a  note  by  James(1971).  A  Floquet  analysis  is  applied  to  an  MIT- Rule 
one-parameter  linear  adaptive  system  (adaptive  feedforward  only)  with  periodic  inputs.  The  result 
is  a  complex  stability-instability  boundary  revealing  multiple  resonance  phenomena,  precisely  like 
those  associated  with  the  Mathieu  equation,  e.g.,  Hale(1969).  No  analysis  was  undertaken  for  the 
case  with  adaptive  feedback,  where  the  system  then  is  nonlinear. 

The  work  of  Ljung(1977)  which  utilizes  a  technique  of  stochastic  averaging  can  perhaps  be 
said  to  herald  the  beginning  of  a  nonlinear  analysis  technique,  but  its  applicability  to  adaptive 
systems  in  non-ideal  situations  was  not  fully  explored  at  that  time.  Lyapunov  techniques,  explored 
by  Anderson  and  Johnstone(1983)  and  loannou  and  Kokotovic(1983),  showed  that  persistently 
exciting  signals  are  required  to  provide  a  uniform  asymptotic  stability  of  the  ideal  adaptive  system, 
and  this  in  turn  provides  robustness  to  various  unmodeled  dynamics  and  disturbances.  Input- 
output  formulations  providing  a  local  stability  were  developed  in  Kosut  and  Johnson(1984)  and 
Kosut  and  Anderson(1986);  these  also  needed  persistent  excitation.  In  none  of  these  examinations 
of  the  stability  properties  of  adaptive  control  systems  was  the  precise  mechanism  for  the  parameter 
drift  instability  identified. 


'Roughly,  a  transfer  function  G(s)  is  SPR  if  it  is  stable,  and  for  some  constant  e  >  0,  Re[G(ju;  -  t))  >  0,  Vw. 
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Method  of  Averaging 


The  first  insights  came  in  a  series  of  papers  by  Astrom  (1983,1984),  which  provided  an  analysis  based 
on  the  classical  method  of  averaging  of  Bogoliubov  and  Mitropolski(1961).  The  method  requires 
slow  adaptation  and/or  small  signal  magnitudes,  and  under  these  conditions  Astrom  was  able  to 
identify  the  source  of  the  slow  drift  instability  mechanism.  This  was  made  precise  in  a  paper  by 
Riedle  and  Kokotovic(1984,1985),  where  they  established  an  average  SPR  condition  which  provides 
a  sharp  stability-  instability  boundary,  again  in  the  case  of  slow  adaptation  with  periodic  inputs, 
but  only  for  the  linearized  adaptive  system.^  Extentions  of  this  important  result  to  other  than 
periodic  signals  and  the  relation  to  persistent  excitation  and  unmodeled  dynamics  is  provided  in 
Kosut,  Anderson,  and  Mareels(1986).  Extending  the  method  of  averaging  to  the  full  adaptive 
feedback  case  has  also  been  developed,  e.g.,  Riedle  and  Kokotovic(l986),  Kosut(1986,1987),  and 
Bodsen  ei  al.  (1985). 

During  this  period  many  of  the  researchers  involved  worked  more  or  less  in  consort.  A  summary 
of  these  early  efforts  is  contained  in  Stability  of  Adaptive  Systems:  Passivity  and  Averaging  Analysis 
(MIT  Press)  by  Anderson  et  al.  (1986).  The  material  in  the  text  represents  some  modifications 
and  refinements  of  the  earlier  work  mentioned  above. 


Present  Status 

In  summary,  we  now  understand  that  the  primary  cause  of  poor  performance  in  adaptive  systems  is 
that  the  true  plant  system  is  not  in  the  parametric  model  set  upon  which  the  parameter  estimator 
is  designed.  At  best,  the  underlying  model  set  is  an  approximation,  perhaps  only  valid  over  a 
limited  range  of  dynamics.  Hence,  the  states  of  the  parameter  estimator  cannot  be  separated  from 
the  states  of  the  plant  system.  In  consequence,  adaptive  algorithms  derived  from  the  passivity 
based  theory,  which  requires  this  strict  separation,  can  be  very  sensitive  to  even  slight  unmodeled 
disturbances  or  high  frequency  dynamics. 

At  present,  there  are  basically  two  research  “schools”,  namely:  (1)  adapt  slowly  and  insure 
robustness,  and  (2)  develop  adaptive  algorithms  which  are  inherently  robust.  In  our  research  under 
AFOSR  funding  we  have  concentrated  on  both  of  these.  To  accomplish  (1)  we  have  applied  and 
extended  the  method  of  averaging  analysis  which  leads  to  the  frequency  domain  average  SPR  condi¬ 
tion  mentioned  before.  To  accomplish  (2)  we  have  developed  what  we  call  “uncertainty  estimation” 
[cf.  Kosut(  1987b)].  The  idea  is  to  replace  the  model  estimator  in  Figure  2.1  with  one  that  produces 
an  estimate  together  with  a  measure  of  uncertainty.  The  control  design  rule  is  replaced  with  a  rule 
which  accepts  a  set  of  uncertainty  for  design.  These  approaches  to  (1)  and  (2)  have  been  developed 
for  linear  systems  in  general  and  space  structures  in  particular.  As  part  of  our  future  research,  we 
plan  to  extend  these  techniques  to  nonhnear  flexible  systems. 

In  the  next  few  sections  we  will  show  how  we  plan  to  develop  these  methods  to  further  our 
understanding  in  the  case  of  nonlinear  flexible  systems.  We  first  dicsuss  the  method  of  averaging 
in  Section  2.2  and  then  uncertainty  estimation  in  Section  2.3. 


^The  average  SPR  condition  is  significantly  less  restrictive  than  the  usual  SPR  condition  because  the  latter  requires 
that  a  certain  closed-loop  transfer  function  be  positive  at  all  frequencies,  whereas  the  former  depends  also  on  a  signal 
spectrum  and  requires  that  the  energy  at  those  frequencies  where  the  transfer  is  positive  should  dominate  those 
frequencies  where  it  is  negative. 
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2.2  Averaging  Analysis  of  Adaptive  Control  Systems 

In  this  section  we  review  the  clcissical  method  of  averaging,  review  the  current  status  with  respect 
to  adaptive  control  of  linear  systems,  and  indicate  how  we  plan  to  extend  the  idezis  to  nonlinear 
systems.  We  also  provide  a  motivating  example  simulation  of  an  adaptively  controlled  nonhnear 
system  exhibiting  chaos. 

2.2.1  Classical  Method  of  Averaging 

The  classical  method  of  averaging  analysis  considers  systems  of  the  form 

x  =  'ff(t,x)  (2.1) 

where  7  is  a  positive  constant.  Under  suitable  smoothness  conditions  the  solutions  of  the  above 
system,  for  ah  small  7  >  0,  are  approximated  to  order-7  by  solutions  of  the  “averaged  system” 

y  =  lf{y)  (2.2) 

where  f{y)  is  the  average  over  t  of  f(t,y)  for  fixed  y,  assuming  the  average  exists.  That  is, 

Ry)  =  ^  fit,  y)  dt  (2.3) 

The  details  on  averaging  theory  including  the  smoothness  conditions,  averageable  functions,  etc.,  are 
contained  in  many  textbooks,  e.g.,  Bogoliuboff  and  Mitropolskii  (1961),  Hale(1969),  Sanders  and 
Verhulst(1987).  The  usefulness  of  averaging  is  that  the  averaged  system,  although  still  nonlinear, 
is  simpler  to  analyze.  For  example,  let  y.  denote  an  equilibrium  of  the  averaged  system,  that  is,  y, 
satisfies 

S{y>)  =  0 

If  y.  is  a  stable  equilibrium  of  the  averaged  system,  then  there  is  a  stable  solution  x,(t)  of  the 
original  system  (2.1)  which  is  contained  in  an  order-  7  neighborhood  or  orbit  of  y,.  Thus,  the 
original  system  inherits  the  stability  of  the  averaged  system,  which  is  eaisily  determined  from  a 
linearization  about  the  equilibrium  y..  In  addition,  the  region  of  attraction  of  the  averaged  system 
to  the  equilibrium  y.  is  to  within  order  -7  of  the  region  of  attraction  of  the  original  system  to  the 
stable  orbit  i.(t). 


2.2.2  Current  Status:  Slowly  Adapting  Linear  Systems 

The  one  difficulty  in  directly  applying  the  clcissical  method  of  averaging  to  adaptive  systems  is  that 
they  do  not  fit  the  standard  form  of  (2.1).  Specifically,  the  system  of  Figure  2.1,  as  well  as  most 
adaptive  control  systems,  can  be  represented  by  the  set  of  coupled  ordinary  differential  equations: 
3 

X  =  git,x,9)  (2.4) 

e  =  7g(t,x,^)  (2.5) 

^Since  dynamical  systems  are  controlled  by  a  digital  computer,  by  rights  we  should  use  difference  equations  to 
describe  the  parameter  adaptation.  Nonetheless,  we  use  (2.5)  to  Ulustrate  the  main  ideas,  which  for  the  most  part 
carry  forward  to  the  discrete-time  case  [c/.  Ch.  5,  Anderson  et  al.  (1986)]. 
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In  (2.4),  X  G  IR"  is  referred  to  as  the  ’’state”  and  consists  of  dynamical  states  of  the  system  being 
controlled,  controUei  states,  and  filter  states  in  the  parameter  estimator.  The  function  g{t,x,6)  is 
determined  by  the  system  dynamics  and  the  controller/estimator  design.  In  (2.5),  6  G  IR'’  is  the 
’’parameter  estimate”  whose  rate  of  change  is  governed  by  a  scalar  constant  7  >  0  and  a  function 
q{t,  X,  6)  determined  by  the  designer.  Observe  that  when  the  parameters  are  held  fixed,  the  system 
is  governed  by  the  nonlinear  system  (2.4)  for  constant  parameter  B. 

In  the  case  of  an  adaptive  linear  system,  (2.4)-(2.5)  reduce  to. 


X  =  A{6)x  +  B{d)w{t)  (2.6) 

6  =  ■yq(t,x,9)  (2.7) 


where  w{t)  G  IR'"  consists  of  all  exogenous  inputs  such  as  references  and  disturbances,  and  A{6)  G 
IR”’‘"  and  B{0)  G  IR’*^’”  are  matrix  functions  of  the  parameter  B,  the  specific  function  depending  on 
the  control  design  rule  and  the  parameter  estimation  model  set.  In  this  case  when  the  parameters 
are  held  fixed,  the  system  is  governed  by  the  LTI  system  (2.6)  with  B  constant. 

Following  Anderson  et  al.  (1986),  to  apply  the  method  of  averaging  to  the  adaptive  linear 
system  (2.6)-(2.7) ,  we  make  the  assumption  that  the  adaptation  gain,  7,  is  small.  This  assumption 
corresponds  to  slow  adaptation,  i.e.,  B(i)  is  slowly  changing.  To  further  facilitate  the  analysis  a 
time  scale  decomposition  follows  giving  rise  to  new  differential  equations  that  are  coupled  in  a 
simpler  way.  First,  freeze  B(t)  at  a  constant  value  B,  and  let  x(t,B)  denote  the  solution  to  the 
linear,  constant  system 

X  =  A{B)x  +  B{B)w{t) 

6  =  0 


called  the  “frozen  system”.  Observe  that  x{t,  0)  is  a  function  of  any  vector  6  and  a  scalar  t.  Hence, 
we  may  define  the  error  state 

ri{t)  =  x{t)- x(t,9{t))  (2.9) 

The  adaptive  linear  system  (2.6)-(2.7)  is  then  equivalent  to 

6  =  'liq{t,x{t,6)  +  Vyff)  (2.10) 

dx 

77  =  A{9)r}-y—{t,B)q{t,x{i,6)i-T},B)  (2.11) 


It  is  convenient  to  think  of  77  as  the  "fast”  states,  particularly  when  7  is  small.  Thus,  (2.10)-(2.11) 
is  a  time-scale  decomposition  of  (2.6)-(2.7) .  Now,  if  the  fast  states  asymptotically  decay,  then  6(t) 
approaches  solutions  of 

L  = 'fqit,xit,9,r,),6„)  (2.12) 

which  is  in  precisely  the  form  of  (2.1),  and  hence,  suitable  for  averaging.  Thus,  the  "averaged 
parameter  system”  is 

9a  =  7/(^a)  (2.13) 


where 


f{B)  =  lim 

i  — *00 


T 


q{t,x{t,6),B)dt 


(2.14) 


The  important  results  are  roughly  as  follows: 

(i)  The  parameter  trajectories  of  the  adaptive  linear  system  (2.6)-(2.7)  are  to  within  an  order-7 
of  the  solutions  of  the  averaged  system  (2.13). 
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(ii)  The  stability  of  the  adaptive  linear  system  (2.6)-(2.7)  is  inherited  from  the  stability  of  the 
averaged  system  (2.13). 

(iii)  A  stable  equilibrium  of  the  averaged  system  (2.13)  has  a  large  region  of  attraction  within 
the  constant  parameter  stability  set  associated  with  the  frozen  parameter  system  (2.8),  i.e., 
those  fi.xed  0  for  which  A{d)  is  a  stability  matrix. 

These  results  have  a  far  reaching  impact  on  quantifying  engineering  guidelines  for  adaptive 
control  system  design  of  actual  systems.  In  the  first  place,  they  provide  a  sharp  stability-instability 
boundary.  Secondly,  although  it  is  not  so  apparent  from  this  limited  exposition,  the  conditions  for 
stabilty  of  the  equilibrium  of  the  averaged  system  can  be  formulated  in  the  frequency  domain.  It 
then  becomes  very  clear  as  to  the  relation  between  excitation  and  system  dynamics,  e.g.,  [Ander¬ 
son  et  al.  (1986)],[Kosut  et  al.  (1986)]. 

In  light  of  the  averaging  results  it  is  now  possible  to  more  fully  understand  the  conditions  for  local 
stability  and  perhaps  more  importantly,  the  causes  and  mechanisms  for  instability.  In  summary, 
the  method  of  averaging  provides  a  means  to  assess  the  stability  and  instability  properties  of  slow 
adaptive  linear  systems.  There  are  of  course  some  restrictions,  the  obvious  one  being  the  need  for 
slow  adaptation.  More  importantly,  how  slow  is  slow?  The  answer  is  not  well  quantified  at  the 
present  time. 

To  remove  the  restrictiveness  of  slow  adaptation  will  require  a  better  understanding  of  the 
transient  and  bifurcating  behavior  of  the  adaptive  system.  Some  preliminary  results  on  the  tran¬ 
sient  behavior  is  presented  in  Kosut  and  Bitmead(1986,  1987).  For  adaptive  linear  systems,  when 
the  adaptation  gain  7  is  incresed,  bifurcation  and  chaotic  phenomena  have  been  observed,  e.g., 
[Mareels  and  Bitmead(1986,1988)],  [Salam  and  Bai(1988)],  [Cyr  et  al.  (1983)],  [Ydstie(1986)],  and 
[Schoenwald  et  al.  (1987)].  In  these  studies,  the  underlying  system  is  linear.  Thus,  rapid  adaptation 
can  induce  bifurcating  and  chaotic  behavior  even  in  linear  systems.  Even  under  slow  adaptation, 
such  phenomena  will  appear  if  the  system  being  controlled  is  nonlinear,  as  seen  in  the  example 
simulation  to  follow. 

2.2.3  A  Next  Step:  Slowly  Adapting  Nonlinear  Systems 

As  outlined  above,  the  method  of  averaging  so  far  has  been  extended  and  applied  only  for  adaptive 
linear  control  systems,  that  is,  if  the  adaptive  parameters  were  held  fixed,  then  the  resulting  closed- 
loop  system  would  be  linear.  This  is  certainly  not  the  case  for  flexible  mechanical  systems  which 
have  significant  friction  and  backlash  as  well  as  actuator  saturation.  Also,  during  rapid  slewing 
maneuvers,  the  effect  of  the  nonlinear  kinematics  is  apparent.  Although  the  method  of  averaging 
can  handle  nonlinear  systems  in  theory,  see,  e.g.,  Hale(1969),  the  application  to  adaptive  nonlinear 
control  remains  an  open  area  for  basic  research. 

In  principal,  some  of  the  linearization  type  results  in  Anderson  ei  al.  (1986)  will  undoubtedly  be 
applicable,  modulo  some  technical  conditions.  There  is  some  early  work,  e.g.,  Volosov(1962),  which 
may  be  more  directly  applicable.  However,  unlike  the  adaptive  linear  system,  in  the  nonlinear  case 
the  system  may  exhibit  bifurcations  and  chaotic  behavior  even  though  the  adaptation  is  slow.  It  is 
these  type  of  issues,  which  are  intrinsically  nonlinear,  that  appear  as  the  next  step  in  providing  a 
more  complete  and  realistic  picture  of  the  use  and  limitations  of  parameter  adaptive  control. 

An  example  is  presented  in  the  next  section  which  illustrates  some  of  these  isaues. 
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2.2.4  Example:  Adaptive  Control  of  a  Chaotic  System 

To  provide  motivation  for  studying  adaptive  control  of  nonlinear  flexible  systems  we  will  present 
an  example  simulation  which  vividly  illustrates  the  research  issues.  Consider  the  Duffing  system: 

X  +  Ai  +  =  tfr  (2.15) 

where  A:  >  0  is  a  small  damping  coefficient,  the  cubic  term,  i^,  represents  a  nonlinear  spring,  0  is  a 
feedforward  control  gain,  and  r  is  a  reference  command  which  is  to  be  followed  by  the  displacement 
X.  Prior  knowlege  of  the  damping  and  nonlinear  spring  term  is  assumed  unavailable,  and  hence, 
the  the  control  parameter  B  wiU  be  adapted  as  follows: 

B  =  7r(r  -  x)  (2.16) 

The  constant  7  >  0  as  before  is  used  to  adjust  the  speed  of  adaptation.  This  adaptation  rule  is  a 
so-called  “gradient”  rule  (sometimes  referred  to  as  the  ”M.I.T.-Rule”,  [Whitakar(1959)])  because 
the  rate  of  adjustment  is  proportional  to  the  negative  gradient,  with  respect  to  the  parameter  fl,  of 
an  instantaneous  error  function.  In  this  case 

where  tp  is  the  instantaneous  gradient  of  x,  that  is, 


Thus,  for  constant  0,  tp  satisfies  the  differential  equation 

ktp  Zx^ip  =  r 

Since  rp  depends  on  the  unknown  damping  and  cubic  nonlinearity  in  the  system  (2.15),  the  pure 
gradient  rule  6  =  7V'(r  -  x)  cannot  be  implemented.  Using  the  (crude  but  simple)  approximation 
-ip  K  r  yields  the  algorithm  of  (2.16). 

Before  studying  the  adaptive  system  (2.15)-(2.16)  ,  recall  that  the  Duffing  system  (2.15)  for 
constant  9  has  been  extensively  examined.  In  particular,  Ueda(1980)  made  an  exhaustive  study 
with 

dr  =  B  cos  t 

and  tabulated  the  resulting  long-term  behavior  as  a  function  of  the  parameters  {k,B)  as  shown  in 
Figure  2.2. 

For  example,  with  {k,B)  =  (.08,  .2),  there  are  five  coexisting  periodic  attractors  as  shown  in 
Figure  2.3.  For  {k,  B)  =  (.05, 7.5),  (.25, 8.5),  or  (.1, 12.),  the  attractors  are  all  chaotic,  as  seen  from 
the  (x,  x)-plane  Poincare’  sections  at  the  strobe  times  t  €  {2irA; :  fc  =  1, . . .  ,4250}  in  Figure  2.4. 
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Figure  2.2:  Regions  of  long  term  behavior  of  Duffing  system  in  the  {k,  B)  plane.  From  Ueda(1980) 
as  reprinted  in  Thompson  and  Stewart(1986  ). 
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Figure  2.3:  Samples  of  periodic  and  multi-periodic  long-term  behavior  of  the  Duffing  system  under 
sinusoidal  forcing.  From  Thompson  and  Stewart(1986). 


Figure  2.4:  Poincare’  sections  in  the  (a:,i)  plane  for  three  of  Ueda’s  chaotic  attractors.  From 
Ueda(1980)  as  reprinted  in  Thompson  and  Stewart(1986). 
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Given  the  quite  complex  behavior  of  the  system  (2.15)  when  9  is  constant  and  r  is  a  sinusoidal 
reference,  it  is  to  be  expected  that  if  6  is  slowly  adapted,  then  the  system  (2.15)-(2.16)  will  pass 
through  regions  which  contain  chaotic  and/or  periodic  attractors. 


Simulation  Results 


Simulations  of  (2.15)-(2.16)  were  performed  under  the  following  conditions: 


reference 


r  =  .1  cost 


damping 


k  =  .05 


adaptation  gain 


7  =  S 


0 

.01 

.05 


0  <  t  <  25Q{2r) 
250(27r)  <  t  <  750(2;r) 
750(27r)  <  t  <  4250(27r) 


initial  conditions 

=  (75,0,0) 

integration  algorithm 

4th-order  Runge-  Kutter  integration  algorithm  with  a  fixed  step-size  equal  to  .005  of  the 
period  of  the  reference  signal  r,  t.e.,  step-size=(.005)(27r). 


To  initialize  the  system  there  is  no  adaptation  (7  =  0)  for  the  first  250  periods.  This  is  followed 
by  500  periods  of  very  slow  adaptation  (7  =  .01),  and  then  the  remainder  of  the  time  at  relatively 
slow  adaptation  (7  =  .05).  The  adaptive  gain,  7,  was  chosen  to  produce  slow  adaptation  to  prevent 
any  possible  bifurcations  and  chaos  soley  as  a  result  of  too  rapid  an  adaptation,  which,  as  previously 
mentioned,  is  known  to  occur  in  adaptive  systems  even  when  the  system  being  controlled  is  linear. 
Thus,  initially  after  the  adaptation  begins. 


9r  «  7.5  cos  t 

Referring  to  Figure  2.4,  the  initial  response  of  (2.15)-(2.16)  will  be  chaotic.  What  we  hope  to  see 
is  that  the  adaptation  brings  the  system  to  a  less  violent  condition,  which  in  fact  is  what  occurs. 
Figures  2.5-  2.7  show  the  results  of  the  simulation  over  4250  periods  of  the  reference,  t.e.,  for 
0  <  t  <  4250(27r). 

Figure  2.5  shows  values  of  x  and  9  at  the  l-period  strobe  times  t  G  {2-Kk  :  k  =  1,...,4250}. 
Figure  2.6  shows  the  {x,9)  Poincare’  section  at  these  strobe  times.  Observe  that  as  is  adapted, 
the  system  passes  in  and  out  of  chaotic  and  periodic  behavior.  Specifically,  reading  Figure  2.5  or  2.6 
from  left  to  right:  (initially)  chaotic  — ►  3-periodic  -+  chaotic  2-periodic  — >  1-periodic  — >  chaotic 
— »  3-periodic  -»■  (finally)  1-periodic.  Despite  this  incredibly  complex  behavior,  the  adaptation  of 
9  continues  relentlessly  to  monitonically  decrease  the  parameter  value  until  a  good  steady  state  is 
reached.  In  fact,  some  simulations  with  9  constant  (not  shown  here)  verify  that  the  final  steady- 
state  adaptive  parameter  is  very  near  to  the  constant  value  of  9  minimizing  average  (i  -  r)^.  This 
is  precisely  the  desired  property  of  the  so-called  gradient  algorithm  (2.16).  Figure  2.7  shows  some 
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samples  of  time  histories  of  {9,  x,  i)  vs.  t  for  5  periods  of  the  reference,  and  also  phase  plots  of  x 
vs.  X  for  Uie  same  5  periods.  Comparison  of  the  initial  chaotic  phase  plot  with  the  final  phase  plot 
shows  a  significant  improvement  in  performance  in  terms  of  rms  error  between  x  and  r  as  well  as 
mildness”  of  response. 


e  500  1000  15CQ  2000  2500  3000  3500  <1000  <1500 

periods 


Figure  2.5:  Values  of  x  and  9  at  1-period  strobe  times. 
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Figure  2.6:  Upper.  Poincare’  section  in  {x,9).  Lower  Expanded  scale  of  upper  at  the  end  of  the 
simulation. 
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Figure  2.7:  Samples  of  (i,  x)  phase  plots  and  corresponding  time  histories  of  {0,x,x)  during  the 
simulation. 
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2.2.5  Implications  for  Future  Research  Directions 

The  simulation  results  are  very  encouraging  and  strongly  support  the  use  of  adaptive  control  to 
minimize  the  potentially  deleterious  effects  of  uncertain  nonlinearities.  It  is  also  encouraging  that 
the  adaptive  algorithm  (2.16)  is  very  simple  to  construct  and  implement.  A  basic  question  which 
arises  from  the  simulation  study  is  simply: 

Why  does  the  adaptation  work? 

In  fact,  as  seen  in  Figures  2.5  and  2.7,  the  magnitude  of  9  decreases  monitonically  at  an  almost 
uniform  rate,  seemingly  independent  of  whether  the  system  is  chaotic  or  periodic.  Moreover,  at  the 
end  of  the  simulation,  as  seen  in  the  lower  plot  of  Figure  2.6,  the  parameter  exponentially  spirals 
towards  the  final  steady-state  shown  in  the  bottem  time  history  of  Figure  2.7. 

At  the  present  time  the  exact  mechanism  or  conditions  which  give  rise  to  this  pleasing  perfor¬ 
mance  are  not  known.  Understanding  the  underlying  principals  is  a  major  goal  of  our  research 
program.  Given  that  slow  adaptation  is  at  least  prudent,  we  will  certainly  consider  this  situation 
in  our  future  efforts,  and  along  these  lines  we  can  appeal  to  the  method  of  averaging  analysis  which 
we  used  with  success  for  analyzing  adaptive  linear  control  systems  under  slow  adaptation. 
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2.3  Synthesis  via  Uncertainty  Estimation 


In  this  section  we  discuss  another  approach  to  adaptive  control  which  is  referred  to  as  “uncertainty 
estimation”.  This  approach  has  been  used  successfully  for  linear  systems  representing  LSS  in 
particular,  cf.  Kosut(1987b). 


Figure  2.8:  Adaptive  control  with  uncertainty  estimation. 


The  uncertainty  estimation  approach,  depicted  in  Figure  2.8,  differs  from  the  usual  adaptive 
system  of  Figure  2.1  in  a  number  of  ways: 

•  The  scheme  involves  the  on-line  identification  of  a  nonlinear  model  set,  and  then  tuning  or 
redesigning  the  controller  based  on  the  identified  model  set. 

•  The  model  set  generated  by  the  identification  process  contains  information  as  to  the  accuracy 
of  the  identified  model  set. 

•  The  control  design  rule  is  robust  with  respect  to  the  identified  model  set,  that  is,  the  closed- 
loop  system  satisfies  all  performance  requirements  for  every  plant  in  the  identified  model 
set. 

•  There  is  an  outer  loop  containing  a  “decision  rule”  as  to  when  to  use  the  model  estimate. 

This  is  a  fundamentally  different  approach  from  the  usual  adaptive  schemes  (cf.  Figure  2.1), 
where  the  identification  produces  a  single  estimated  model,  with  no  information  regarding  its 
accuracy.  In  a  sense  uncertainty  estimation  shifts  the  focus  from  the  stability  analysis  view  as 
expounded  in  Section  3,  to  the  system  identification  view. 

In  this  regard  there  are  several  basic  research  issues  which  need  to  be  resolved  for  this  approach 
to  adaptive  control  to  be  effective,  namely:  (1)  model  set  definition,  (2)  robust  nonlinear  control  de¬ 
sign  rules,  and  (3)  uncertainty  estimation  of  nonlinear  flexible  systems.  In  tin-  next  few  subsections 
we  provide  a  brief  description  of  what  is  involved  in  understanding  these  issues. 
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2.3.1  Model  Set  Definition 


The  starting  point  in  control  design  or  system  identification  is  a  definition  of  a  model  structure 
which  characterizes  the  fundamental  attributes  of  the  dynamical  system.  In  control  design  we  also 
need  a  quantitative  description  of  model  accuracy,  otherwise  there  is  no  way  to  insure  that  the 
design  is  robust.  Thus,  the  identification  process  should  produce  not  only  a  model  estimate,  but 
also  an  estimate  of  uncertainty.  In  this  task  we  will  define  a  model  set  which  captures  both  the 
dynamical  structure  as  well  as  being  able  to  describe  accuracy. 

Specifically,  we  will  begin  by  e.xamining  model  sets  with  the  following  features: 

•  Uncertain  Parameters.  A  capabihty  to  account  for  that  part  of  the  system  which  is  known 
to  be  governed  by  physical  laws  or  able  to  be  described  by  known  functions  dependent  on 
certain  constant  parameters.  The  parameters  may  only  be  known  to  lie  within  some  range  of 
variation. 

•  Uncertain  Nonlinearities.  Able  to  describe  uncertainty  in  the  nonlinearities  which  contribute 
to  the  system  dynamics.  These  nonlinearities  can  still  be  dependent  on  uncertain  parameters, 
but  here  there  structure  is  uncertain  as  well. 

•  Unmodeled  Dynamics.  Able  to  account  for  uncertain  dynamics  for  which  a  parametric  struc¬ 
ture  is  not  available  or  assumed,  e.g.,  no  good  physical  models  or  the  result  of  approximations 
such  as  neglecting  fast  dynamics  and  linearization. 

One  of  our  goals  is  to  investigate  the  use  of  model  sets  with  the  above  features  for  describing 
uncertain  nonlinear  flexible  systems.  In  a  later  section  we  describe  a  particular  model  for  the 
purposes  of  analysis.  However,  there  are  a  number  of  well  known  methods  for  describing  uncertain 
nonlinear  systems. 


Singular  Perturbations 

In  this  formulation  the  sytem  dynamics  are  decomposed  into  two  time  scales,  where  c  is  a  positive 
constant  which  reflects  the  time  scale  separation  between  the  “slow”  states  rand  the  “fast”  states  z.. 
This  form  is  known  to  be  compatible  with  typical  areospace  systems,  see,  e.g.,  Menon  et  a/.(l987). 
However,  the  singular  perturbation  parameter  e  is  not  easily  chosen  as  a  physical  parameter,  and 
there  is  some  freedom  also  in  partitioning  the  states. 

The  convenience  of  the  above  form  is  that  a  formal  theory  exists  for  separating  the  time  scales, 
e.g.,  Chow  and  Kokotovic(1978). 


Conic  Sectors 

Here  we  follow  in  the  tradition  of  Popov(1961),  Zames(1966),  Safonov  (1980,1981),  and  Doyle(1982,1984), 
to  name  a  few.  The  idea  of  conic-sector  modeling  is  quite  general  and  does  encompass  the  singular 
perturbation  approach  described  above,  however,  it  is  more  of  an  input-output  view  than  a  state- 
space  view.  The  original  ideas  by  Popov  and  Zames  have  been  generalized  by  Safonov  and  Doyle, 
particularly  for  LTI  sy terns.  A  recent  result  by  Safonov(1987)  extends  the  original  sufficient  sector 
stability  conditions  estiblished  by  Popov,  to  necessary  conditions,  but  requires  solving  a  convex 
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programming  problem.  We  hope  to  explore  this  latter  capability.  We  wiU  also  investigate  the 
optimal  scaling  methods  for  LTI  conic-sectors  established  by  Doyle(  1982,1984). 


2.3.2  Robust  Nonlinear  Control  Design 

There  are  now  several  good  methods  for  designing  nonlinear  controllers.  In  this  section  we  discuss 
a  few  that  can  be  applied  to  nonlinear  flexible  systems. 

Feedback  Linearization 

One  particular  method  which  we  will  investigate  is  the  so  called  feedback  linearization,  e.g.,  Hunt, 
Su,  and  Meyer(1983).  The  basic  idea  is  to  apply  a  nonlinear  feedback  which  brings  the  closed- 
loop  system  into  a  purely  linear  form,  which  has  obvious  advantages  for  control  design.  A  general 
discussion  of  the  theory  can  be  found  in  Isidori(19?).  A  difficulty  with  the  method  is  that  the 
linearizing  feedback  requires  an  exact  knowledge  of  the  type  of  system  nonlinearities.  Robustness 
to  model  errors  is  an  open  area  in  basic  research.  Some  work  on  robustness  has  been  advanced 
in  the  robotics  area,  where  feedback  linearization  is  there  referred  to  as  “computed  torque”,  see, 
e.g.,  Spong  and  Vidyasagar(1985).  Another  route  is  to  adapt  to  maintain  parameters  in  a  known 
nonlinear  structure,  e.g.,  Craig,  Hsu,  and  Sastry(1986),  Slotine  and  Li(1987).  The  more  practical 
adaptive  problem  would  involve  parameter  adaptation  where  the  true  system  nonlinearities  differ 
in  form  from  those  in  the  nonlinear  model  set. 


Singular  Perturbations 

The  advantage  of  the  formal  time-scale  separation  discussed  before  is  that  the  properties  of  the  full 
nonlinear  system  can  be  determined  by  analyzing  the  simpler  slow  and  fast  nonlinear  subsystems. 
Moreover,  the  slow  and  fast  controllers  can  be  separately  designed. 

The  time-scale  separation  parameter  e  is  essentially  a  measure  of  the  effect  of  the  unmodeled 
(fast)  dynamics  on  the  modeled  (slow)  part.  As  mentioned  before,  it  is  often  a  difficult  task  to 
define  e  in  terms  of  physical  quantities,  and  hence,  the  practicle  usefulness  involves  some  guesswork. 
However,  a  very  nice  property  of  the  above  approach  is  that  the  analysis  and  formalism  is  valid  for 
nonlinear  systems.  Thus,  one  can  start  the  modeling  process  with  the  full  nonlinear  model,  and 
by  introducing  e  in  the  right  place,  a  subsequently  appropriate  model  reduction  automatically  fol¬ 
lows.  We  plan  to  investigate  the  applicability  of  this  approach  to  representing  uncertain  dynamical 
systems. 


Robust  Linear  Control 

In  attempting  to  develop  robust  control  laws  for  nonlinear  systems  we  will  certainly  appeal  to  ex¬ 
isting  work  on  linear  systems.  For  example,  robustness  of  linear  feedback  systems  utilizes  frequency 
domain  expressions  for  characterizing  a  set  of  uncertainty  within  which  lies  the  true  plant,  e.g., 
Francis  and  Zames(1983),  Safonov  et  a/.(1981),  Doyle  and  Chu  (1985),  Vidyasagar(198')).  In  these 
approaches,  uncertainties  in  the  plant  model  are  described  in  the  frequency  domain,  ami  iMlnist 
stabilization  and  performance  are  guaranteed  if  the  peak  value,  as  a  function  of  freqii'  ii.  \ .  of  a 
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certain  closed  loop  transfer  function  is  sufficiently  small.  The  peak  value  turns  out  to  be  an  appro¬ 
priate  norm  in  the  space  of  transfer  functions,  referred  to  as  the  J^oo-norm.  The  design  for  robust 
stabilization  and  performance  can  then  be  converted  into  a  minimization  problem  in  the  /foo-norm. 

Robust  stabilization  of  linear  systems  in  the  presence  of  parameter  uncertainty  is  a  rather  more 
difficult  problem,  and  at  the  present  time  there  is  no  known  solution,  although  many  promising 
approaches  have  been  advanced.  Recently,  the  analysis  and  design  of  robust  control  systems  based 
on  Kharitonov’s  theorem  and  its  extensions,  has  received  considerable  attention  in  the  literature, 
e.g.,  see  Barmish  (1987),  and  Wei  and  Yedavalli  (1987).  Although  still  in  its  infancy,  this  approach 
will  eventually  lead  to  efficient  control  design  methods  that  can  handle  structured  parameter  vari¬ 
ations.  Methods  based  on  LQG  synthesis  have  also  been  advanced,  e.g.,  Tahk  and  Speyer  (1986) 
for  general  linear  systems  and  Tahk  and  Speyer(1987)  for  LSS. 

We  believe  that  some  of  the  design  methods  described  in  the  references  above  can  be  further 
developed  by  exploring  the  particular  structure  of  uncertainty  occurring  in  LSS  plants.  Further,  we 
envision  the  combination  of  different  design  methodologies  to  achieve  the  synthesis  of  controllers 
that  are  robust  with  respect  to  combined  structured  and  unstructured  uncertainty. 

2.3.3  Uncertainty  Estimation  of  Nonlinear  Systems 

Uncertainty  estimation  is  an  identification  process  which  produces  a  model  set  estimate  compatible 
with  prior  knowledge  regarding  uncertainty.  The  estimator  and  control  design  rule  use  model  sets, 
rather  than  a  single  model  estimate  as  is  usually  the  case.  These  model  sets  contain  information 
on  model  accuracy,  i.e.,  accuracy  of  parameter  estimates  and  unmodeled  dynamics. 

Tlicre  are  two  types  of  problems  in  designing  “uncertainty”  extimators: 

•  Estimation  With  No  Prior  Knowledge  of  Uncertainty 

In  this  case  there  is  no  knowledge  available  about  the  extent  of  uncertainty  in  the  unmodeled 
dynamics.  The  indentification  scheme  should  be  modified  so  as  to  produce  not  only  an 
estimated  model,  but  also  a  measure  of  model  uncertainty. 

•  Estimation  With  Prior  Knowledge  of  Uncertainty 

In  this  case  a  bound  on  the  uncertainty  of  the  unmodeled  dynamics  is  available.  The  identi¬ 
fication  scheme  should  be  modified  so  as  to  produce  a  parametric  model  which  is  as  close  to 
the  true  system  as  possible,  given  the  prior  information. 

2.3.4  Stability  Analysis 

There  is  no  general  theory  of  stability  analysis  for  adaptive  systems  which  use  an  uncertainty  esti¬ 
mator.  In  the  linear  case  there  are  some  results  based  on  least-squares  estimation,  cf.  Kosut(1988). 
For  nonlinear  systems  there  are  some  general  methods.  For  example,  one  can  mention  the  so-called 
Total  Stability  Theorem  for  dealing  with  nonlinear  differential  equations  having  the  general  form 

X  =  f{t,x)  + g{t,x) 

where  x  =  f{t,x)  is  the  “nominal”  system  and  where  g{t,x)  represents  “perturbations.”  The 
original  form  of  the  theorem  is  due  to  Malkin(1958).  Later  versions  can  be  found  in  Bellman(1953), 
Hale(1969),  and  more  recently  in  Anderson  et  al.  (1986).  Roughly,  the  Total  Stability  Theorem 
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asserts  that  if  the  nominal  system  is  stable,  and  the  perturbations  are  bounded  and  sufficiently 
smooth,  then  the  perturbed  system  is  also  stable.  One  often  tries  to  make  the  nomial  system  linear 
or  of  a  special  nonlinear  structure,  e.g.,  adaptive  in  the  ideal  case  of  no  unmodled  dynamics.  This 
is  certainly  an  avenue  which  we  will  explore. 

Another  route  is  based  on  the  structure  of  Figure  2.8.  That  is,  the  decision  rule  switch  is  open 
for  a  period  of  time  while  identification  takes  place.  That  is,  the  parameter  estimation  process  is 
sparate  focibly  from  the  control  design  process.  Thus,  the  control  parameters  are  held  fixed  at  there 
previous  values  for  a  period  of  time  while  new  values  are  learned.  For  example,  let  dk-i  denote  a 
previously  determined  parameter  value  which  is  being  held  fixed  during  the  current  observation  or 
learning  interval  f  6  [(A:  —  1)T,  fcT].  Thus,  the  adaptive  parameters  are  in  effect  determined  by  the 
iterative  map 

Ok^TkiOk-l),  k  = 

where  r;t(')  is  defined  implicitly  by 

X  =  g(t,x,ek-i) 

6  =  79(t,i,tf)  e{{k  -  l)T)  =  ek-\ 

6k  =  e{kT) 

An  interesting  research  question  is  what  happens  if  the  observation  intervals  is  long,  that  is,  7  small 
and  T  large.  From  the  previous  averaging  results  one  would  suspect  that  for  all  small  7  >  0: 

lim  lim  0k  =  6,-\-  0(7) 

T— *00  fc— »oo 

where  6.  is  a  fixed-point  of  the  map  r(0)  defined  implicitly  as  the  equilibrium  of  the  averaged  system 
in  the  following  sense: 

1  rto+T 

lim  -  /  dt  =  0 

r-*oo  1 

Some  preliminary  results  in  Phillips,  Kosut,  and  Franklin(1988)  suggest  that  r(-)  is  contractive  in 
the  constant  parameter  stability  set  if  and  only  if  6,  is  a  uas  equilibrium  of  the  averaged  system 
(2.13).  This  conjecture  would  also  tie  together  some  existing  results  on  bounding  the  actual  fixed- 
point,  see,  e.g.,  Kosut(1988). 
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2.4  Representaive  Nonlinear  Model  and  Testbed 


In  this  section  we  discuss  in  some  detail  the  type  of  nonlinear  flexible  system  to  be  adaptively 
controlled.  In  this  regard  we  first  discuss  a  project  at  ISI  which  can  provide  a  realistic  model  base 
as  well  as  an  experimental  apparatus. 


2.4.1  Adaptive  System  Testbed 

Under  an  Army  funded  program  at  ISI,  we  are  developing  an  "adaptive  system  testbed”  for  rapid 
prototyping  of  advanced  adaptive  control  algorithms.  The  objective  of  the  testbed  facility  is  to  be 
able  to  design,  develop,  test  and  validate  weapon  system  control  strategies  so  that  a  high  degree  of 
confidence  can  be  established  in  transferring  the  technology  to  test  vehicles  and  operational  systems. 
The  testbed  is  specifically  designed  for  testing  weapon  pointing  control  of  flexible  systems. 


Hardware  Configuration 

The  proposed  system  for  the  rapid  prototyping  of  control  algorithms  will  consist  of  an  Apollo 
workstation,  a  multi-processor  real-time  controller  and  the  testbed  (see  figure  2.9).  The  Apollo 
workstation  wiU  be  used  for  developing  and  simulating  the  control  algorithms,  for  generating  real¬ 
time  code,  and  for  interfacing  with  the  real-time  control  processor.  The  real-time  control  processor 
will  execute  the  code  generated  for  the  control  algorithm  and  interface  with  the  testbed. 


MATRIXx/WS 


MATRIXx 
Dloek-Dugrairn 
Autocode  Code 
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Compilation^ 

Real-Time  Inicctacc 
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Compiler  Linker 
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Port 


• 

1^ 
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1  Testbed 
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•  Storage  of  Acquired  Data 


Figure  2.9:  Adaptive  Control  Development  System 
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Software  Elements 


The  feature  making  the  Adaptive  Control  testbed  most  desirable  for  control  system  experimen¬ 
tation  is  the  ability  to  easily  and  rapidly  reprogram  the  processors.  Controller  architectures  and 
parameters  are  specified  in  block  diagram  form  at  a  high  level  using  SYSTEM-BUILD,  then  the 
SYSTEM-BUILD  diagrams  are  converted  to  real-time  control  programs  through  an  automatic  code 
generator.  This  code  is  then  compiled  and  downloaded  to  all  or  some  of  the  processors  for  real-time 
e.xecution.  The  user  can  interface  with  the  code  executing  on  the  controller  through  graphical  icons 
on  the  host  workstation. 


Mechanical  Testbed  Fixture 


A  test  fi.xture  (sec  Figure  2.10)  is  proposed  so  that  variations  in  crucial  physical  parameters  can 
be  quantified  and  tested  easily,  for  example:  drive  compliance,  drive  freeplay  (backlash),  barrel 
length,  disturbances,  and  sensors.  Physical  variations  will  be  accomplished  where  possible  by 
interchangeable  and  mechanically  adjustable  elements,  otherwise  the  effects  will  be  emulated  in  the 
real  time  processor. 


Figure  2.10: 


Pointing  Testbed 


By  making  all  of  the  above  functions  readily  adjustable  either  mechanically  or  by  emulation  in 
the  processors,  the  performance  of  different  control  algorithms  b2ised  on  various  sensor  and  resolver 
configurations  can  be  compared. 

2.4.2  Modeling  of  Nonlinear  Flexible  Systems 

Although  the  Duffing  system  (2.15)  used  in  the  simulation  example  is  interesting,  it  does  not  capture 
aU  of  the  essential  characteristics  of  flexible  nonlinear  systems  under  control,  and  moreover,  does 
not  well  represent  the  Adaptive  Testbed.  However,  because  it  has  been  so  extensively  studied, 
we  do  plan  to  use  it  as  a  stepping  stone  to  more  representative  systems.  In  particular,  a  more 


representaive  scenario  is  depicted  below  in  Figure  2.11. 


Figure  2.11:  A  fle.xible  rotating  system  with  backlash  in  the  gear-train. 


This  system  represents  the  case  where  the  actuation  is  applied  to  a  load  through  a  fle.xible  and 
possibly  geared  device.  This  is  a  common  situation  in  mechanical  systems  such  as  robots  or  in 
proposed  active  controllers  for  large  space  structures.  The  gearing  is  shown  to  occur  at  the  end 
of  the  flexible  member,  although  other  combinations  may  be  possible,  this  perhaps  being  the  more 
common. 

Neglecting  any  electronic  dynamics,  and  assuming  that  the  flexible  rod  is  both  uniform  and 
undamped,  the  motion  of  the  system  for  small  angular  deflections  is  well  approximated  by  the 
partial  differential  equation  in  time  t  and  spatial  coordinate  i. 


x)  -  i)  =  0  (2.17) 

with  boundary  conditions 

pzxit,0)  =  (2.18) 

pzxit,^)  =  + 9[a{t)]  (2.19) 

JlK^)  =  Ng[a{t)]  (2.20) 

where  y  is  the  load  output,  u  is  the  user  applied  torque,  and  a  is  the  relative  gear  angle 

a{t)  =  z{t,()  -  Ny{t)  (2.21) 


The  constants  are  defined  as  follows:  Jm,  Jgi  anti  Jl  are  the  motor,  motor  gear,  and  load  inertias, 
respectively,  N  is  the  gear  ratio  which  is  greater  than  one,  p  is  the  tortional  mass  density,  and  p  is 
the  tortional  stiffness.  The  nonlinear  function  g{-)  arises  from  backlash  in  the  gear  train,  and  has 
the  typical  shape  as  shown  in  Figure  2.12. 

The  break-point  parameter  06  relates  to  gear  teeth  spacing  and  the  slopes  in  the  two  regions  relate 
to  gear  teeth  shapes.  Typically  for  |q|  >  06  the  slope  is  very  large  whereas  for  |a|  <  06  the  slope 
is  very  small. 

Observe  that  (2.17)-(2.20)  contains  many  of  the  features  of  a  realistic  flexible  system.  Firstly, 
it  is  a  PDE  and  thus  captures  the  large  number  of  modes  which  characterize  flexible  systems. 
Secondly,  the  nonlinearity  g{-)  is  common  to  pract'cally  all  actuators,  both  in  robotics  and  those 
proposed  for  space  structure  active  control.  Moreover,  this  type  of  nonlinearity  is  also  representative 
of  the  interaction  between  joints  and  flexible  structural  members  in  a  space  structure.  Of  course 
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Figure  2.12:  A  typical  backlash  function. 


there  is  no  reaJ  end  to  posing  even  more  complicated  models,  and  at  some  level  even  (2.17)-(2.20) 
can  be  accused  of  being  too  simple.  The  same  applies  to  the  Duffing  system  (2.15),  but  the  counter 
argument  is  based  on  the  rich  complexity  which  results  from  this  “simple”  model,  e.g.,  Figure  2.2. 
Nonetheless,  we  take  the  position  that  (2.17)-(2.20)  is  our  representaive  system  for  the  purposes  of 
analysis.  As  mentioned  before,  we  still  plan  to  use  the  adaptive  Duffing  system  (2.15)-(2.16)  as  a 
stepping  stone  to  further  our  understanding  of  systems  like  (2.17)-(2.20)  . 

As  an  intermediate  step  we  will  also  make  use  of  the  following  ODE  as  a  one-mode  approximation 
of  (2.17)-(2.20)  which  includes  damping: 


Jm'zi  =  u- D{z2- zi)-\- K{z2- zi)  (2.22) 

Jch  =  “5(®)  (2.23) 

JlV  =  Ng{a)  (2.24) 

a  =  22  -  Ny  (2.25) 

where  21,22  ^.re  defined  by 

2i(t)  =  2(t,0)  Z2{t)  =  z{t,l)  (2.26) 


and  where  the  constants  D  and  K  represent  damping  and  stiflFness,  respectively. 
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Adaptive  Control 
via 

Parameter  Set  Estimation 

Robert  L.  Kosut*^ 


Abstract 

An  adaptive  control  system  is  examined  where  the  traditional  parameter  estimator 
is  replaced  by  a  parameter  set  estimator  which  provides  a  measure  of  uncertainty  of 
the  estimated  parameters.  It  is  shown  how  to  construct  a  parameter  set  estimator  with 
the  property  that  the  true  system  is  always  in  the  model  set,  and  furthermore,  how  to 
design  the  estimation  experiment  so  that  the  set  of  uncertainty  is  as  small  as  possible 
given  some  a  priori  information. 
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1  Introduction 


A  parameter  adaptive  control  system,  as  depicted  in  Figure  1,  consists  of  two  processes 
which  make  the  system  adaptive,  namely:  (1)  a  model  parameter  estimator,  and  (2)  a 
control  design  rule. 


Figure  1:  Generic  Adaptive  Control  System 

The  parameter  estimator  (ID)  operates  on  the  input-output  data  obtained  from  mea¬ 
surements  (y,  u)  of  the  plaint  system  (P)  to  be  controlled,  producing  a  model  parameter 
estimate  6  €  The  parameter  estimate  is  transformed  by  the  control  design  rule  (D) 
into  a  controller  parameter  p  €  IR^,  which  is  then  used  in  a  pre-determined  parametric  con¬ 
troller  structure  (C)  in  feedback  with  the  actual  system.  Questions  regarding  the  behavior 
of  this  system  can  be  answered  by  describing  the  estimator  properties,  in  particular,  the 
’’goodness”  of  the  ’’converged”  model  set. 

From  the  viewpoint  of  designing  a  robust  control,  by  a  good  model  set  is  roughly  meant 
one  for  which  every  member  produces  a  controller,  via  the  control  design  rule,  which  when 
applied  to  the  actual  system  yields  acceptable  closed-  loop  performeince  with  regard  to 
attenuating  the  effect  of  the  disturbance  d  and  tracking  the  reference  r.  Hence,  a  properly 
designed  model  estimator  should  converge  to  a  member  of  that  model  set  which  yields  the 
requsite  closed-loop  performance. 

However,  in  the  adaptive  control  system  of  Figure  1,  no  specific  accuracy  informa¬ 
tion  about  (P)  is  passed  to  the  control  design  rule  (D).  Thus,  we  reach  a  very  important 
observation; 

The  model  estimator  should  be  designed  to  be  compatible  with  the  same  class  of 
uncertainty  anticipated  by  the  robust  control  design  rule. 

The  idea  then,  is  to  replace  the  usual  single  point  parameter  estimator  of  Figure  1  with  a 
parameter  set  estimator  which  provides  a  set  of  peu-zuneter  values,  and  hence,  a  meeisure 
of  parameter  estimation  accuracy.  The  control  design  rule  must  then  also  be  modified  so 
as  to  accept  this  uncertainty  description  and  produce  a  controller  which  is  robust  to  the 
parameter  set. 

The  estimation  of  a  transfer  function  from  measured  data  has,  of  course,  a  long  history, 
and  there  will  be  no  attempt  to  document  that  here.  A  good  source  of  references  as  well 
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as  providing  a  clear  elucidation  of  the  technical  issues  is  the  textbook  by  Ljung(19S7). 
One  means  to  provide  a  measure  of  uncertainty  in  the  transfer  function  estimate,  is  based 
on  utilizing  the  theoretical  asymptotic  staitistics  to  produce  confidence  intervals  from  chi- 
squared  distributions.  Underlying  this  approach,  however,  is  the  assumption  that  the  true 
plant  is  in  the  model  set,  and  hence,  estimation  errors  are  assumed  to  be  dominated  by 
the  variance  contribution  for  a  sufficiently  long  observation  interval.  In  the  case  considered 
here,  the  dominant  contribution  is  the  bias,  which  is  primarily  due  to  unmodelled  dynamics 
and  disturbances.  Moreover,  it  is  important  when  designing  the  estimation  experiment  to 
be  aware  of  the  intended  use  of  the  estimte,  which  in  the  case  considered  here,  is  control 
design. 

Some  recent  work  along  these  lines  may  be  found  in  Wahlberg  and  Ljung(19S6),  Gevers 
and  Ljung(19S6),  Wittenmark(19S7),  and  Hansen(1988).  The  idea  in  these  references  is 
to  manipulate  the  user  design  choices  for  the  identification  experiment,  such  as  data  filters 
and  input  spectra,  so  that  the  criterion  for  estimation  looks  like  the  criterion  for  the 
intended  use  of  the  model  estimate,  e.g.,  control  design  in  this  case.  Another  approach  is 
to  use  high  order  model  sets  and  then  model  reduction  for  control  design,  e.g.,  Wahlberg( 
1986)  uses  high  order  least-squares,  Parker  and  Bitmead(1987)  use  high  order  FIR  models. 
The  appeal  of  these  approaches  is  that  no  parametric  model  need  be  advanced.  In  this 
case  the  transfer  function  estimation  error  consists  of  a  term  which  depends  only  on  the 
true  transfer  function  and  decays  as  model  order  increases,  plus  a  term  which  depends 
on  the  noise- to-  signal  ratio  and  may  increase  as  model  order  increases.  These  terms  can 
be  bounded,  but  the  results  are  asymptotic  giving  only  qualitative  information.  Another 
potential  difficulty  is  the  high  order  which  may  be  too  computationaJij’’  intensive  in  some 
situations.  The  work  of  LaMaire  et  al.  (1987)  overcomes  the  high  order  problem  by  using 
a  low  order  parametric  model  set.  The  procedure  results  in  parameter  estimates  along 
with  estimating  a  bound  on  the  unmodelled  dynamics.  In  some  cases  this  will  also  require 
considerable  computation.  In  Kosut(1986,  1987),  a  middle  road  was  taken  where  least- 
squares  is  used  to  estimate  the  parametric  model  and  a  standard  spectral  estimator  is  used 
on  the  residual  output  error  resulting  in  a  crude  bound  on  the  unmodelled  dynamics  and 
disturbance  spectrum. 

In  this  paper  we  concentrate  on  developing  a  parcimeter  set  estimator  from  the  standard 
least-squares  parameter  estimator  together  with  some  a  prion  information  about  the  the 
plant  system  unmodeled  dynamics  and  disturbances.  Least-squares  parameter  estimation 
is  one  of  the  most  widely  used  methods  for  obtaining  peurEimeter  estimates.  Its  popularity 
arises  undoubtedly  because  of  the  ease  in  finding  a  solution,  i.e.,  the  solution  is  a  unique 
global  minimum  and  has  both  a  closed-form  solution,  sometimes  called  batch-least-squares 
(BLS),  and  a  recusive  solution  method,  or  recursive-least-squares  (RLS).  The  problem  or 
disadvantage  is  that  the  resulting  model  estimate  obtained  via  LS  will  be  ’’biased”  eis  a 
result  of  inevitible  modeling  approximations,  most  notably,  unmodeled  disturbances  and 
dynamics.  But,  the  disadvantage  can  be  alleviated  by  the  advantage.  In  the  approach 
presented  here,  we  exploit  the  known  solution  structure,  and  together  with  some  a  priori 
data,  produce  a  parameter  set  estimator  which  contains  the  true  system.  We  also  show  how 
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to  design  the  estimation  experiment  so  that  this  set  is  as  small  as  possible  with  respect  to 
the  o  priori  information.  Specifically,  we  use  a  transfer  function  model  set  which  consists 
of  a  parametric  part  with  a  known  structure,  and  an  unstructured  part  which  is  known 
a  priori  to  be  bounded  in  the  frequency  domain.  Once  this  is  done,  two  results  follow, 
namely,  (1)  an  upper  bound  on  parameter  bias  can  be  determined  from  the  on-  line  data, 
and  (2)  the  bound  cam  be  made  small  by  appropriate  choice  of  data  filters  and  input  spectra. 
This  approach  follows  quite  closely  to  that  of  Morrison  and  Walker(1988),  the  results  here 
being  applicable  to  more  general  model  sets.  Also,  the  upper  bound  on  paraimeter  error 
obtained  here  is  asymptotically  tight  given  the  a  priori  data,  and  can  be  computed  using 
the  standard  DFT.  A  preliminary  version  of  this  paper  is  in  Kosut(1988). 

Other  important  estimator  requirements,  such  as  the  rate  of  convergence  and  region  of 
attraction,  which  play  a  significant  role  in  estimator  design,  are  not  specifically  addressed. 
For  example,  if  convergence  to  the  model  set  is  slow,  and  the  initial  model  estimate  is 
too  coarse,  then  unacceptable  behavior  may  occur  during  the  learning  process.  Designing 
adaptive  control  systems  with  a  prescribed  rate  of  convergence  or  region  of  attraction  is 
not  a  solved  problem,  although  much  is  understood  in  the  case  of  slow  adaptation,  e.g., 
Anderson  et  al.(1986). 
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2  Parametric  Transfer  Function  Modeling 

2.1  Sampled-Data  Structure 

We  start  with  the  assumption  that  the  true  plant  system  is  accessed  through  a  Scimpled- 
data  structure  where  control  commands  and  data  acquisition  occur  at  simultaneous  discrete 
sampling  instances  separated  by  uniform  intervals.  Thus,  the  true  plant  system  can  be 
described  by  the  discrete-time  relation 

y  =  Gu->r  d  (1) 

where  y{t)  and  u{i)  are  the  measured  output  and  input  sequences,  respectively,  evaluated 
at  the  normalized  sample  times  t  €  {. . . ,  —1,0, 1,2, . . The  sequence  d{t)  represents  the 
effect  of  disturbances  sources  as  seen  at  the  output.  The  operator  G  is  linear  and  time- 
invariant  with  transfer  function  G{q),  where  q  is  the  shift  operator,  i.t.,  (g^i)(t)  =  x{t  +  k) 
for  any  integer  k.  Unless  needed  for  clarification  we  will  suppress  the  ’9’  and ’t’  arguments 
in  transfer  functions  and  sequences,  respectively. 

Note  that  the  input  u  may  be  generated  by  a  feedback  mechanism. 


2.2  Transfer  Function  Parametrization 

The  plant  transfer  function  is  assumed  to  have  the  following  structure: 

^  -Btfo  + 


As.  -f  Aa 


where 


=  I'o  +  K.i^i 

Aso  =  Mo  +  T,i=i(io,xMi  (3) 

&o  —  (^Oil  '  *  '  ^o,n  ^0,1  ■  '  ■  G  IR^,  p  ~  Tl  TTl 

and  Li,  Mi,  Ab,  Aa  are  stable  transfer  functions.  We  refer  to  Oo  €  IR'’  as  the  true  parameier, 

Bs 

c.  =  (4) 

as  the  parametric  transfer  function,  and  to  Ab,  Aa  as  nonparametric  or  unmodeled  dynam¬ 
ics  in  the  numerator  and  denominator,  respectively.  The  following  data  is  assumed  to  be 
available  a  priori  : 


A  Priori  Data; 

(i)  Li,  Mi  are  known  stable  transfer  functions. 

(ii)  Ab,  Aa  ore  uncertain  stable  transfer  functions  satisfying: 

|AB(e^'-)|  <  |WB(e>“)|,  VH  <  TT 
|A*(ei")|  <  |»'a(£^)I,  V|w|  <  I 


4 


where  are  known  stable  transfer  functions. 

(Hi)  The  true  parameter  do  is  in  the  set 


0nom  =  {00  e  W  ;  a™"  <  Oo.,-  <  o”",  <  bo,i  <  6r"}  (6) 

This  model  structure  and  a  priori  assumptions  define  a  family  of  transfer  functions. 
Within  this  family,  or  model  set,  are  many  of  the  common  transfer  function  model  sets 
used  for  robust  control  design.  For  example,  there  is  the  ubiquitous 


bo,iQ  * 

1  +  ao,i5“‘ 


(7) 


where  L  represents  nonparametric  dynamics.  The  choices  of  the  model  elements  in  (2) 
depend  upon  a  priori  information  about  L.  Suppose,  for  example,  that  Z  is  an  a  priori 
estimate  of  L,  and  it  is  known  that  the  relative  error  L  =  L/L  —  1  satisfies 

|£(e'“)l  <  |W'i(e^)|,  V|^l  <  , 


The  corresponding  choices  in  (2)  are  then: 


Bbo  =  i:Ubo,iq-'L 

—  Id* 

Ab  =  LBg^ 

Aa  =  0 


(8) 


The  transfer  functions  {Li,  Mi}  in  (2)  follow  from  the  above  definition  and  are  obviously 
stable.  In  addition,  the  unmodeled  dynamics  are  bounded  by; 


Wa  =  0 

Wb  =  {instable:  sup, 


(9) 


A  similar  configuration  would  follow  if  the  unmodeled  dynamics  involved  the  absolute 
error,  i.e.,  where  L  =  L  —  L. 

Another  common  model  form  arises  from  the  theory  of  stable  factorization,  see,  e.g., 
Vidya.sageir(1985).  In  this  case  Be^  and  represent  stable  co-prime  factors.  For  example. 


m  -i 

Bbo  —  Xy 

«=i  ^ 

1  "  5“’ 

At,  =  ^  + 


(10) 


where  D  is  a  polynomial  in  5"^  such  that  l/D  is  stable.  In  this  case,  Ab  and  Aa  are 
unmodeled  dynamics  in  the  stable  factors. 

In  many  instamces  the  plant  transfer  function  is  bilinear  in  a  particular  physical  pa¬ 
rameter.  For  example,  a  mass  at  the  tip  of  a  flexible  shaift  will  enter  linearly  in  both  the 
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numerator  and  denominator  of  the  transfer  function  from  the  tip  position  to  the  torque 
at  the  shaft  hub,  see,  e.g.,  Rovner  and  Franklin(1987).  In  general  there  are  always  fewer 
’’physical”  parameters  than  ’’canonical”  parameters  such  as  transfer  function  coefficients. 
A  general  bilinear  form  for  a  physical  transfer  function  parametrization  is  (2)  with 

Be.  =  Lo  +  ZLi^o,iLi 

=  M)  +  ULl  (11) 

Bo  =  (^0.1 

where  now  6o  €  IR^  is  the  true  physical  parameter  vector.  Observe  that  (11)  allows  for 
common  parameters  in  the  numerator  and  denominator  of  the  parametric  model,  and 
hence,  is  a  more  general  form  than  (3). 

2.3  Caveat  Emptor 

The  plant  parametrization  (2)  is  limited  by  the  ’’locations”  of  the  unmodeled  dynamics, 
Ab,  Aa,  which  represent  relative  errors  in  both  the  numerator  and  denominator,  respec¬ 
tively.  If  more  is  known  about  the  source  of  the  unmodeled  dynamics,  then  there  is 
a  tendency  towards  conservatism  which  limits  the  control  design  performance,  see,  e.g., 
Doyle  et  al.  (1982).  Other  more  sophisticated  structures  for  the  puposes  of  transfer  func¬ 
tion  parameter  estimation  can  be  advanced  [e.g.,  Krause  and  Khargonnekar(1987)],  but 
these  will  not  be  pursued  here,  since  for  many  cases  our  proposed  structure  suffices. 
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3  Least- Squares  Parameter  Estimation 

3.1  Batch-Least-Squares 

The  batch-least-squares  parameter  estimator  transforms  the  measured  data  {y,u  :  t  Q 
[0,  iV  —  1]}  into  the  parameter  estimate  6j;f  E  by  solving: 

(12) 

where  ee  is  a  parametric  error  sequence  generated  from  the  meeisured  data,  and  (■)^  denotes 
averaging  over  the  TV-point  observation  interval,  i.e., 

i^)N  =  E  (13) 

t=o 


The  parametric  structure  of  eg  is  motivated  by  the  model  structure  of  (2).  Thus,  we 
form  the  (filtered)  parametric  equation  error 


£0  =  Wp{A0y  —  Bgu) 


(14) 


where  Wp  is  a  stable  data  filter,  and  corresponding  to  the  form  of  (2)  and  (3), 

Be  =  Lo  +  YaLi 

Ae  =  +  (15) 

e  = 

The  error  can  be  written  as  an  aiEne  function  of  6,  that  is, 

£g  =  z-  e'^4>  (16) 


with 

^  =  Wp  (Moj/  -  Lou) 

(f)  =  Wf  {~Miy  •  •  •  MnV  Liu  •  •  •  Xmw)^ 


(17) 


We  refer  to  ^  E  B.’’  as  the  regressor.  Clearly,  both  z  E  H.  and  (f>  E  IR'*  axe  obtained  by 
filtering  the  measured  data.  Observe  that  in  the  more  general  paraimetrization  of  (11), 


z  =  Wp(M<,y-i„«) 

<f>  =  Wp(Tiu -Miy-'-XpU  -  Mpj/)^ 


(18) 


With  the  above  definitions  of  z  and  the  familiar  solution  to  (12)  is 

provided  the  indicated  inverse  exist.  The  following  result  is  now  easily  obtained: 


(19) 
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Lemma  1  (Linear  Regression  Model  and  Parameter  Error)  The  true  system  is  equiv¬ 
alent  to  the  linear  regression  model, 

2  =  (-0) 

where 

=  fFp  (-Aa^  +  Ab«)  (21) 

£d  =  P^F(A9.  +  AA)d  (22) 

In  addition,  the  parameter  error  can  be  decomposed  as  follows: 

^  (23) 

where  6^  is  the  error  contribution  due  to  nonparametric  dynamics  and  is  the  error 
contribution  from  the  disturbance.  Specifically, 

6%  =  (24) 

(25) 

The  expression  (20)  can  be  thought  oi  as  a  "canonical  linear  regression  model”  with  a 
parameter  Bg  to  be  estimated  from  "measurements”  {z,  ^  :  t  6  [0,  iV  —  1]}.  The  sequences 
represent  the  effect  of  the  nonparametric  dynamics  and  disturbances,  respectively. 
That  is,  behavior  of  z  which  can  not  be  accounted  for  by  the  peirametric  part  <i>. 

The  parameter  error  expressions  in  Lemma  1  generedize  those  fo\md  in  Morrison  and 
Wedker(198S).  Observe  that  in  the  expression  for  the  right  hand  side  is  a  function 
of  the  regressor  sequence  {^},  which  is  obtained  from  measurements,  and  the  unknowns, 
Ab,  Aa,  and  In  the  expression  for  Bff  the  only  unknown  in  the  right  side  is  the  error 
sequence  {£<i}.  Thus,  any  attempt  to  provide  on-line  boimds  for  the  parameter  error  will 
involve  a  priori  data  as  well  as  measured  data. 


3.2  Recursive- Least- Squares 


The  recursive-least-squares  (RLS)  algorithm  is  [see,  e.g.,  Lj\mg(1987)]: 

£{t)  =  z{t)  - 

^RLS  ^  (26) 

R{t)  =  R{t~l)  +  <l>{t)<f>{tf 

The  algorithm  starts  at  f  =  1  with  initial  values  B^  and  Ro  =  Rj  >  0.  For  t  >  1  the  RLS 
estimate  is  explicitly  given  by 


^RLS 


(27) 
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Assuming  that  exists  for  all  t  >  t<,  >  1,  then  for  sufficiently  large  t  >  to  and/or 

small  Ro,  the  RLS  estimate  is  approximately  the  BLS  estimate  for  a  data  record  of  length 

(28) 

A  principal  adavantage  of  RLS  over  BLS  is  that  memory  requirements  can  be  vastly  dif¬ 
ferent  especially  for  very  long  data  records.  BLS  stores  records  of  length  O  (t),  whereas 
RLS  stores  only  O  (p).  Since  the  RLS  estimate  will  approach  the  BLS  estimate,  RLS  is 
usually  the  method  of  preference  even  if  the  computations  cire  to  be  done  off-line,  e.g., 
RLS  is  used  repeatedly  over  the  data  set. 


3.3  Parameter  Set  Estimation 

The  problem  of  parameter  set  estimation  is  to  transform  the  measured  data  {y,u  :  t  E 
[0,N  —  1]}  plus  any  a  priori  data  into  the  parameter  set  estimate 

0  =  {^  e  IR'’ :  !|^  -  ^11  <  (29) 

where  ||  •  H  is  a  vector  norm  on  IR^,  ^  is  a  nominal  estimate,  and  6  specifies  its  uncertainty 
in  IR'’.  Conceptually,  the  set  estimator  is  the  map 

r  {!,,u:<e(0,JV-ll)  1 

•  +  I  ►->  §  (30) 

^  a  priori  data  J 

In  order  for  0  to  be  useful  for  control  design  we  &st  require  that  the  true  parameter  6o  is 
in  this  set,  that  is, 

00  €  0  (31) 

Secondly,  the  set  should  be  sufficiently  smadl  so  as  to  insure  a  robust  control  design  which 
also  satisfies  some  performance  specifications.  Hence,  a  second  constraint  is 

(32) 

where  tfspec  is  obtained  from  closed-loop  design  specifications.  Without  the  design  con¬ 
straint  the  estimation  problem  has  no  practical  meaming,  because  then  typically  the  a  pri¬ 
ori  set  is  sufficient  for  design;  the  result  being  a  stable,  but  low  performance  closed-loop 
system. 
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4  Parameter  Error  From  Unmodeled  Dynamics 


Using  Lemma  1,  we  will  establish  a  bound  on  |(^^1|  which  depends  on  the  meeisured  and 
a  -priori  data.  The  norm  used  is  the  ^oo-norm  on  that  is,  ||x||  =  max{|x,|  :  i  6  [l,p]}, 
which  is  consistent  with  independent  model  parameters  as  in  (6).  One  can  rationalize  other 
choices  cis  well.  For  example,  the  ^2-norm,  ||xl|  =  implicitly  imposes  a  relation 

amongst  the  parameters  which  may  be  sensible  in  some  problem  context. 

In  the  results  to  follow  we  use  ||  •  ||  to  zdso  denote  the  matrix  norm  induced  by  the 
^oo-norm,  that  is,  for  any  matrix  A  with  (possibly  complex)  elements  a,j, 

|1A||  sup  jlAi||  =sup]^|ay|  (33) 

lkll=i  i 

We  also  use  ’abs  [.4]’  to  denote  the  matrix  whose  elements  are  the  absolute  values  of  each 
element  of  the  matrix  A,  thus,  ||A|j  =  ||abs  [A]  |(.  The  Discrete- Fourier  Transform,  or  DFT, 
of  a  sequence  {/  :  i  €  [0,  N  —  1]}  is  defined  here  as 

1  N—l  ‘yirlc 

FnM  =  DFT(/}  =  /(<)=■’"'.  «  €  An  =  {-j^  ;  J:  €  [0,Ar  -  1]) 

where  are  the  normalized  DFT  frequencies. 

Theorem  1  (Bound  on  Parameter  Error)  Lei  {<&/^(a;),  y;v(w),U;v(‘*^)  :  w  6  flAr}  de¬ 
note  the  DFT’s  of  the  N -point  sequences  {<f>,y,u  :  t  €  [0,N  —  1]},  respectively,  -with  DFT 
frequencies  fi/v-  Lei  WBF(t)  and  WAF{t)  denote  the  pulse  response  of  WpAs  and.  WfAa, 
respectively.  Suppose  there  are  stable  transfer  functions  Ws  and  W^,  and  constants  M  >  0 
and  p  €  (0, 1),  such  that: 


lAB(e^-)|  <  lWB(eni,  VH  <  TT 

(34) 

lAA(e^<^)|  <  |WA(e^")l,  Vla;|  <  TT 

(35) 

max{|u;BF(t)|,  ^afCOI}  ^  Vt  >  0 

(36) 

Then, 

119^11  <  I|7aI|  +  IItbII  +  § 

(37) 

where  7a 

,  7b  €  R'’  are  given  by 

7a  =  X:  abs[wF(e^'“)PFA(e^'")lV(u;)(9^.^'');;'^N(u;) 

(38) 

7b  =  E  abs[wF(e^‘-)PFB(e^“)C^A,(u;)(#^)''$yv(w^ 

(39) 

u-en^v  *■ 

and  ^  is 

the  constant 

“ w  -  ^ ) ")  (T-  9)'  0 

with  :  t  G  (—00,00)}  the  N -periodic  extension  0/ {y,  tt  :  <  6  [0,  iV  —  1]}. 


Proof:  See  Appendix. 


Remarks:  (1)  The  iV-periodic  extension  of  the  iV-point  sequences  y,u  are  obtained  by 
copying  t/,u  for  all  iV-point  intervals  in  (—00,00). 

(2)  In  the  special  case  when  y,  u  are  A'’-periodic,  /5  =  0,  and  the  bound  (37)  on  the 
parameter  error  is  tight,  that  is, 

sup  11^^11  =  IItaII  +  IItbII  (41) 

Ab  ,Aa 

Thus,  for  large  values  of  N ,  the  bound  is  asymptotically  tight. 

(3)  The  constant  p  G  (0, 1)  is  the  magnitude  of  the  least  stable  pole  of  either  TVVAb  or 
VVp  Aa  • 


4.1  Computing  The  Bound 

To  compute  the  bound  in  (37)  requires  both  measured  and  a  priori  data.  The  first  two 
terms  in  the  bound  involve  the  vectors  7b, 7a-  These  depend  on  the  measured  iV-point 
data  {<f),y,u  :  t  6  [0,  N  —  1]}  and  on  a  priori  knowledge  of  the  model  weighting  functions 
Wb  and  Wx-  The  measured  data  cam  be  used  to  compute  the  required  DFT’s  as  well  as 

The  last  term  in  (37)  decays  as  1/iV,  and  hence,  can  be  neglected  for  a  sufficiently  long 
data  record.  If  not  neglected,  then  its  computation  depends  on  a  number  /?.  To  compute 
/?,  observe  that  the  first  and  second  factors  in  depend  on  meaisured  data.  The  first  uses 
current  data,  that  is,  data  for  t  6  [0,iV—  1],  However,  the  second  factor  needs  information 
from  Biblical  Times  (t  =  —00)  to  just  before  the  current  data  record  (t  =  0).  Since  this 
second  factor  is  essentially  a  measure  of  the  ne^l^ness  to  7'7-periodicity  of  y,u,  it  is  likely 
that  a  reasonable  upper  bound  can,  be  found  from  aneilysis  or  experiments.  The  third  factor 
depends  on  a  priori  knowledge  of  the  pulse  response  of  the  nonparametric  error  dynamics. 


4.2  Design  Choices 


The  designer  can  affect  the  size  of  the  bound  by  direct  choice  of  the  data  filter  IFf,  and 
indirectly,  by  manipulating  some  exogenous  probing  signals.  Both  of  these  choices  will 
effect  7b,7a,  and  0. 

Selection  of  the  data  filter  Wp  is  based  on  the  underlying  assumption  that  the  para¬ 
metric  model  is  a  good  approximation  of  G  over  some  range  of  frequencies  fip-  Hence, 
a  natural  choice  for  the  data  filter  is  one  that  has  the  property  that 


|k7F(e>-)| 


l-fO(77),  wGfip 
O  (rj) ,  otherwise 


(42) 


where  7  is  a  small  positive  number  which  can  be  interpreted  as  passband  or  stopband 
’’ripple”.  As  a  result  we  have: 
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Corllary  1  If  satisfies  (42),  then  for  all  small  tj  >  0, 


<  { sup  I|7aII  +  (sup  HFB(e''“')|')  UtbH  +0{,)  +  ^ 

\uj^Qp  )  ywCOp  f  ^ 

where  P  is  from  Theorem  1  and  7a)7b  ^  are  given  by: 

7a  =  L  abs  [yiv(a;)  «>,v(a;) 

ueOp  '■ 

7b  S  abs  [i7N(w)  ^n(w) 


(43) 


(44) 

(45) 


For  large  N  and  small  tj,  the  bound  in  (43)  is  proportional  to  the  peak  magnitude  in 
the  frequency  domain  of  the  nonparametric  dynamics  in  the  passband  Vtp.  This  is  unlike 
robust  control  where  a  bound  is  needed  over  all  frequencies.  Thus,  the  data  filter  can 
be  used  to  offset  the  impact  of  the  poorly  modeled  nonparametric  dynamics.  A  typical 
example  is 

fip  =  ;  |ut|  e  [uTa.tATi]}  (46) 


The  bound  in  (43)  is  also  proportional  to  a  sum  of  vector  norms  (||7aI|  +  !|7bII)  which 
can  be  related  to  the  condition  number  oi  the  matrix  Recall  that 


N 


cond  =  II  II  II 

To  see  the  relation,  observe  that 

IItaII  <  ll{WOL'llll  E  ubs(iV(u;)M")lll 


T\  \  f  II  Ilwgn;,  abs  [r;v(a;)$;v(u>)] 

II  II 


(47) 


Clearly  the  second  factor  above  is  near  unity  because  (f>  consists  of  filtered  versions  of  y,  u. 
Thus,  data  which  results  in  a  large  condition  number  is  just  as  bad  as  leirge  nonparametric 
model  error  in  the  passband. 

Observe  that  the  smallest  value  of  the  condition  number  is  one,  and  minimization  of 
it  is  a  long  standing  difficult  problem,  especieJIy  in  closed-loop  where  depends  in 

a  complicated  manner  on  the  true  plant  dynamics  and  all  the  exogenous  inputs,  see,  e.g., 
Mareels  et  al.  (1986). 
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5  Application  to  a  Special  Case 


To  provide  a  clearer  interpretation  of  Theorem  1  and  Corollary  1,  we 
to  the  model  structure  of  (7),  that  is,  where 

specialize  the  results 

G  L 

1  +  IT=i  ao,iq-' 

(48) 

L  =  [\  +  L]L 

(49) 

and  where  L  represents  uncertain  stable  nonpaxametric  dynamics  known  to  be  bounded 

by 

1^(6^^^)!  <  \WUen\,  Vlu;|  <  TT 

(50) 

We  then  have: 

TA 

Beo  = 

t=l 

(51) 

n 

t=i 

(52) 

Ab  =  'L 

(53) 

* 

1 

o 

II 

< 

<3 

(54) 

The  least-squares  parameter  error  due  to  the  unmodeled  dynamics 
this  case; 

and  distrbance  is  in 

(55) 

(56) 

m 

Ca  =  LY^ho,iq~'Lu 

t  — 1 

(57) 

•  —  1 

£d  =  WpAe^d 

(58) 

<f>  =  WpL{-q~^y - q~^y 

(59) 

The  above  definitions  lead  to  the  more  concise  expression: 

-  =  ( t" ) 

(60) 

=  (V.-V™) 

Thus, 

•  /  \ 

(61) 

As  a  result,  following  Theorem  1  and  Corollary  1,  we  can  state; 
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Theorem  2 
where  T  £  is: 


r^ll<l|r||  \\K\\  +  Oil/N) 


r=  X!  abs 


/n  addition,  if  Wp  satisfies  (4^),  then 


nxm 

m 


(62) 


(63) 


<  [jup  m(en\\  liril  1|6,1|  +  0(,)  +  0(l/iV) 


where 


r  =  ^  abs 
c«/€f^F 


0„ 


An  example  of  this  model  structure  is  presented  next. 


Xm 


(64) 


(65) 


5.1  Example:  Rotating  Flexible  Servo 

The  example  system  is  a  rotating  flexible  servo  described  in  Astromand  Wittenmark(198S). 
It  consists  of  a  load  connected  by  a  flexible  shaft  to  a  motor/tachometer.  The  servo 
design  objective  is  to  control  the  motor  velocity  despite  changing  loads  and  uncertain  shaft 
flexibilty.  It  is  assumed  that  the  flexible  shaft  is  adequately  modeled  by  its  first  flexible 
mode  whose  frequency  is  well  below  the  bandwidth  of  the  motor/tach  electronics.  Thus, 
a  good  continuous-time  transfer  function  model  from  motor  control  voltage  to  motor/tach 
velocity  output  is^ 

.  ^ _ Ke{Jls'^  +  Dss  +  Ks) _ 

+  {Jl  +  Jm)Dss  +  {Jl  +  Jm)Ks] 

Jl  is  the  load  inertia,  Jm  is  the  motor/tach  inertia,  Ks  is  the  shaft  stiffness,  Ds  is  the 
shaft  damping,  and  ivs  is  the  product  of  the  motor  and  tach  electronic  gains. 

5.1.1  Parameter  Set  Estimation 

The  parametric  model  used  for  parameter  set  estimation  is  obtained  by  neglecting  the 
shaft  flexibility,  thus, 

Voce'S)  =  —  (67) 

s 

^Beware  that  we  use  the  same  nomenclature  to  denote  continuous- time  and  discrete-time  transfer  func¬ 
tions,  t.g.,  G{s)  and  G{q).  These  ate  not  to  be  read  as  the  same  function  of  a  different  variable  s  and  q.  In 

this  case,  G{q)  ZOTi  {G(s)}  where  ZOTt  {•}  is  the  usual  zero-order- hold  transformation. 
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with  true  paxameter 


The  effect  of  the  changing  load  appears  as  a  change  in  the  parameter  Bo-  The  discrete-time 
parametric  transfer  function  is  then 


GaXq)  =  zon 


ffol  _  Ophq  ^ 

Ui  1-9-^ 


where  h  is  the  sampling  interval  and  ZOH  {•}  is  the  usual  zero-order-hold  transformation. 
By  neglecting  the  flexible  modes,  we  have  implicitly  made  the  selection 


L  =  1 


Hence, 


Be,  =  ^09'"  (71) 

=  ^(1-9“')  (72) 

Aa  =  0  (73) 

Ab  =  (74) 

where  the  nonparametric  transfer  function  is 

~  _  20-H{G(s)-GM} 
zon{GM} 

=  (76) 

Since  the  denominator  delay  will  be  cancelled  by  a  numerator  delay,  L  is  stable  [see  any 

table  of  {•}•  transforms].  Also,  L(l)  =  0,  thereby  reinforcing  the  assumption  that 
Gff^  is  a  good  low  frequency  model. 

The  parametric  equation  error  (16)  is  now 

€9  —  i  —6(f> 


From  Lemma  1  we  have. 


z  =  WF^(l-9“^)y 

^  =  WFq~^u 


z  —  do<l>  -f 
=  6oB<i> 

e,  =  W^F^(l-9-')d 


Simulation  Results 

Simulations  were  performed  using  the  following  values  from  Astromeind  Wittenm8u-k(1988): 

Jl  G  [.0002,  .002] 

Jm  =  -002 

Ke  =  .5  (84) 

Ks  =100. 

Ds  =.0001 

The  frequency  and  damping  ratio  of  the  flexible  mode,  as  well  as  the  true  parameter  Bo 
vary  with  Jl  6  [.0002,  .002]  8is  follows: 

frequency  =  ^  [118,50]  (hz) 

damping  =  t  ^  {1-582  x  10-^  1.581  x  lO"**] 

2  V 

Bo  €  [125,227.27] 
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Suppose  the  saimpling  interval  is  h  =  .002  sec,  thus,  the  sampling  frequency  is  500  hz. 
Figure  2  shows  a  plot  of  |Z(e^‘^)|,  w  =  2nhf,  vs.  /  in  hz  for  varying  Jl  €  [.0002,  .002]. 

Suppose  that  the  true  but  unknown  load  is  Jl  —  .002.  Thus,  the  unknown  parameter 
is  6o  =  125  which  is  to  be  estimated.  Since  the  plant  is  unstable  we  use  the  stabilizing, 
but  low  performance,  control 

u  =  1.5(r  -  y)  (85) 

where  r{t)  is  a  user  applied  reference  command. 


Single  Tone  Reference  In  this  experiment  the  reference  is  the  single  tone 

r{t)  =  sin(u;,t),  tu,  =  27r/i/r 

and  the  data  filter  is 


^Vf  =  1 


The  parameter  error  bound  is  then 


Simulations  were  performed  with  the  system  initizJly  at  rest  with  input  tones  at 

fr  =  [5,10,20,40]  hz 

At  each  tone  a  BLS  parameter  estimate  weis  obtained  for  data  records  of  length 

N  =  [64,128,256,512,1024] 


(86) 

(87) 


(88) 


The  results  are  shown  in  Figures  3-5.  Figures  3  and  4  show,  respectively,  the  true  normal¬ 
ized  parameter  error  \Q%l0o\  3Jid  the  parameter  error  estimate  6  from  (88)  superimposed  (in 
x’s)  on  the  graph  of  |Z(e^")|  at  each  reference  frequency  Wr  for  vEurying  data  record  length 
N .  Observe  that  as  N  increases,  both  the  error  eind  the  bound  approach  the  as3aiiptotic 
upper  bound  |Z(€^"’')|  corresponding  to  the  input  tone.  This  is  predicted  by  the  theory. 
Figure  5  displays  the  same  data  as  a  function  of  iV,  i.e.,  the  bound  S,  and  the 

limiting  value  |Z(e''^'’)|  which  does  not  depend  on  N.  All  the  values  are  converging  to  this 
limit  as  N  increases.  A  summary  of  the  plotted  data  is  also  contained  in  Table  1. 
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150 


Figure  2:  |X(e-^“)|  vs.  u)  for  varying  Jl  =  .002  (daxk  line),  Jl  =  .00063  (dashed  line),  and 
=  .0002  (dotted  line). 


Figure  3;  The  x’s  axe  the  true  normalized  parameter  error  \B%l9o\  superimposed  on  the 
graph  of  |Z(e-'")|  vs.  u;  (daxk  line)  at  each  reference  frequency  w,  for  varying  data  record 
length  N. 
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Figure  4:  The  x ’s  axe  the  values  of  the  parameter  error  estimate  S  from  (S8)  superimposed 
on  the  graph  of  vs.  w  (dark  line)  at  each  reference  frequency  Ur  for  varying  data 

record  length  N. 


0  300  600  900  1200  0  300  600  900  1200 

N  N 


Figure  5;  Plotted  vs.  data  length  N  are:  normalized  parameter  error  \6%J0o\  (dark  line), 
parameter  error  estimate  8  (dotted  line),  and  (Z(e^"’’)|  (dashed  line). 
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Square- Wave  Reference  We  repeat  the  experiment  with  a  unit  amplitude  square- 
wave  refrence  whose  base  period  is  256ft  =  .512  sec  (1.953  hz).  The  results  are  shown  in 
Figures  6-9.  Figure  6  shows  the  time  histories  of  the  plant  input  and  output  for  1024  data 
points,  i.e.,  1023ft  =  2.046  sec.  The  data  filter  is  an  Sth  order  Butterworth  low-peiss  filter 
with  cutoff  frequency  wp-  The  experiments  correspond  to  the  following  values  of  cutoff 
frequency: 

wp  =  27rft/p,  /p  =  [5, 10,20,40]  hz 
The  filter  then  behaves  like  (42)  with  the  passband 

fip  =  {a;  6  (S9) 

Figures  7  and  8  show,  respectively,  the  true  normalized  parameter  error  and  the 

bound  ^1  from  (82)  (the  x’s)  superimposed  on  the  graph  of  l£(e''“)|  at  each  filter  cutoff 
frequency  wp  for  varying  data  record  length  N. 

Observe  that  as  N  increases,  both  the  true  error  and  the  estimate  approach  the  asymp¬ 
totic  limit  |Z(e''‘^)|  corresponding  to  the  filter  cutoff  frequency.  This  is  predicted  by  the 
theory.  Figure  9  displays  the  same  data  as  a  function  of  JV,  :.e.,  \9^l9o\,  the  botmd  <5,  eind 
the  constant  limiting  value  |Z(e-'‘^)|.  In  this  case  all  the  values  zire  below  or  near  the  limit 
except  for  one  instance  at  wp  =  5hz  and  a  very  short  data  record  of  JV  =  64.  A  summary 
of  all  the  square-wave  data  is  contained  in  Table  2. 


Figure  6:  Time  histories  of  (y,  r)  (upper)  and  (it,r)  (lower). 
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0  300  600  900  1200  0  300  600  900  1200 

N  N 


Figure  9:  Plotted  vs.  data  length  N  axe:  normalized  pajzuneter  error  l^^/^ol  (dark  line), 
parameter  error  estimate  S  (dotted  line),  and  \L{e^'^)\  (dashed  line). 
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Table  2;  Summary  of  simulation  results  with  squaure  wave  reference 
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5.1.2  Set-Point  Regulator  Design 

The  problem  is  to  design  a  set  point  regulator  for  the  true  plant 

G  +  L),  Gg^  =  “  ^ 

1  -  g  1 

where  9o  and  L  are  unknown  elements  of  the  sets; 


$0  6  [125,227.27] 

\L(en\  <  VH  <  TT 

where  Wl  is  known.  We  will  take 

~  ^  lji=.002 


which  is  the  worst  case  as  seen  in  Figure  2.  The  regulator  is  based  on  the  parameter  set 
estimator  of  where 

\9-6,\<S\6,\ 

with  6  from  (SO)  and  <5  given  by  either  (82)  or  (83).  The  control  sequence  is  obtained  via 
the  feedback  system 

u  =  A'^r  -  y) 

where  Kp  the  regulator  transfer  function,  is  selected  to  satisfy 

1  +  * 

with  T,  the  desired  closed-loop  trnasfer  function  from  r  to  y.  Observe  that  T,  does  not 
depend  on  the  parameter  estimate  6.  Applying  this  control  to  the  actual  plant  results  in 
the  closed-loop  transfer  function 

r  -  r  (l  +  ^)(l  +  -^) 

l  +  T.M 


where 


M  =  {1  +  G){1  +  L)-1 


The  transfer  function  G  reflects 
to  the  constant 


the  parameter  estimation  error,  and  in  this  example  reduces 


Thus, 


M  =  (l  + 1)^-1  = 
6 


L-efi/e, 

i  +  e^/Oo 
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where  9^  is  the  paxameter  error  induced  by  the  unmodeled  dynamics  L.  Since  G,  L,  and 
T.  are  all  stable,  it  follows  that  a  sufficient  condition  for  T  to  be  stable  is 


|r.(e-’“)M(e^‘*')|  <  1,  V|a;l  <  tt  (90) 

To  provide  a  numerical  example,  consider  the  experiment  with  a  square-wave  reference 
and  a  low  pass  data  filter  with  /p  =  20  hz.  With  N  =  1024,  we  have  \9^l9o\  =  .0603, 
(5i  =  .0605,  and  82  =  .1825  [see  Table  2].  Figure  10  shows  lM(e’“)|  vs.  u  for  normalized 
parameter  errors  of  6^/60  =  ±.0603,  ±.1825.  The  trouble  spot  is  clearly  at  50  hz  where 
the  error  is  about  125  db.  Suppose  that  T,  is  chosen  as  a  4th  order  lOhz  low  pass  filter. 
Figure  ??  shows  that  the  closed-loop  system  is  stable  since  the  above  inequality  is  satisfied. 


Figure  10:  |M(e-'^)l  vs.  u  for  normalized  parameter  errors  of  9^/9o  =  ±.0603  (dark  line), 
-.0603  (dotted  line),  ±.1825  (dashed  line),  -.1825  (deish-dotted  line). 


Further  restrictions  are  required  to  satisfy  performzince  and  not  just  stability.  A  con¬ 
venient  expression  which  reflects  closed-loop  performimce  is  the  relative  error  between  the 
true  closed-loop  T  and  the  desired  T,,  that  is: 

f  jrf  r  ^  (1  -  T.)M 

T.  1  +  T.M 

Suppose  that  the  closed-loop  performance  tolerance  is 

|f(e'")|  <  «(»)),  Vlul  <  TT 


where  e(u;)  is  a  specified  function  of  frequency.  A  typiczJ  example  is 


s(uj)  =  I 

[  Chigh,  M  > 


The  frequency  Wc  represents  the  closed-loop  performance  bandwidth,  e.g.,  T.(e-'")  ss  1,  |u,’|  < 
ujc-  A  sufficient  condition  to  insure  the  performance  constraint  is  that 


def 


e(u) 


sup  |A/(e^"')|  <  i(uj)  =  ,  ,  .  , 

which  also  insures  that  |T,(e^‘^)iV/(e-’‘^)l  <  1,  thus,  T  is  stable.  Using  in  place  of  the  left 
hand  side  above  an  upper  bound  results  in 


1~S 


<%) 


Finally,  an  upper  bound  on  parameter  error,  sufficient  to  insure  performance  as  defined 
above,  is 

Observe  that  for  consisitancy  we  need 


|T(e^'^)|  <  %) 

which  also  enforces  the  requirement  that  5  <  1.  To  see  how  to  use  these  relations  for 
design,  let  ciow  =  -1,  Chigh  =  10,000.  Figure  12  shows  a  plot  of  |L(e^")|  and  i{uj)  vs.  u. 
Thus,  performance  will  be  satisfied  if 

\e^/eo\  <6m  =  .0164 

The  minimum  occurs  at  10  hz.  Since  this  value  is  smaller  than  any  of  the  parameter 
error  bounds  (including  the  actual  error),  we  axe  led  to  lowering  the  data  filter  bandwidth. 
Reducing  from  20  hz  to  10  hz  results  in  \0^l9o\  =  -0144,  6\  —  .0146,  and  62  =  .0420 
[see  Table  2].  The  measured  bound  is  smellier  than  Sm,  and  hence,  the  closed-  loop 
performance  will  be  satisfactory  under  these  conditions.  Now,  the  actual  bound  at  any 
one  time  could  be  different,  but  the  robust  control  design  procedure  can  have  an  option  of 
relaxing  the  performance  criterion. 
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FREQUENCY  (HZ) 


Figure  11:  Plotted  against  u  axe  |r,(e''")M(e'^‘*')|  for  the  parameter  error  values  in  Fig¬ 
ure  10,  and  T.(e-'^)(. 

It2 


1  10  100 
FREQUENCY  (HZ) 


Figure  12:  \L{t^^)\  (dashed  line)  and  i{uj)  (deirk  line)  vs.  u>. 


6  Bound  on  Parameter  Error  from  Disturbance 

To  bound  \\0fj\\  will  require  some  a  priori  data  about  the  disturbance  sequence  Consider 
the  special  case  of  Section  5,  with  (58); 

Ed  =  WpAe^d 

The  following  result  is  obtained  if  Sd  has  a  known  RMS  bound. 


Theorem  3  (RMS  Bounded  Disturbance)  Suppose  that 


<  CTd 


Then: 

ii«iii  <  (ii  {h% 


(91) 

(92) 


Proof:  Direct  application  of  Cauchy- Schwartz  and  tricuigle  inequcdities. 

Use  of  this  bound  requires  prior  knowledge  of  Cd-  Rather  than  producing  this  value 
analytically,  since  €d  depends  only  on  the  true  plant  and  disturbance,  it  is  easier  to  run 
some  simulations  or  perform  some  experiments. 

The  effect  of  the  disturbance  on  the  parameter  error  can  be  neglected  whenever  \\9ff  1|  <  < 
From  Lemma  1,  this  occurs  if  <<  (e^)^.  Using  the  special  structure  in  (60) 

gives: 


< 

< 


where  the  last  line  follows  from  Lemma-DFT  in  the  Appendix.  Now  if  Wl  satisfies  (42), 
then  for  large  N  and  small  77, 

(4)„  <  lll'^ll  (sup  |W,.(e'“)|j  +  O  (i/n/1v)  +  O (,)  (93) 

Thus,  the  unmodeled  dynamics  will  be  the  dominant  contributor  to  parameter  error  if 


N 


«  116.11  sup  |irL(e'")| 


(94) 


An  interpretation  is  that  the  ’’noise  to  signal  ratio”  (the  left  hand  side  above)  should  be 
sufficiently  small  as  determined  by  the  ’’model  error”  in  the  passband  (the  right  hand  side 
above). 
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If  the  noise  is  known  to  be  stochastic,  then  the  above  results  tend  to  be  conserva¬ 
tive,  since  information  is  neglected.  For  example,  suppose  it  has  been  determined  (by 
simulations  and  experiments)  that  £d  is  a  zero-mean  sequence  with  auto-spectrum 

S.M  =  (95) 

where  if  is  a  stable  and  stably  invertible  spectral  factor  of  the  standard  form  H{q)  = 
1  -f  [see,  e.g.,  Ljung(1987)].  Hence,  ej  is  the  output  of  the  filter  H  whose 

input  is  a  zero-mean  white  noise  sequence  e  with  variance  Although  H  and  are 
unknown,  suppose  they  are  known  to  be  bounded  as  follows: 

CTe  < 

<  |^(e^“)|,  V|a;|  <  TT 


With  this  information  available,  it  is  more  appropriate  to  use  the  equation  error 


£9  =  H  ^W^(^Agy  —  Bgu) 

(96) 

This  leads  to  the  parameter  error 

%  = 

(97) 

e'd  =  H-^WpHe 

(98) 

Hence,  in  the  passband  of  the  data  filter  Cj  is  effectively  zero-mezm  white  noise  with 
maximum  variance  Taking  expectations  with  respect  to  the  statistics  of  e,  it  follows 
that 

« ”1  (99) 

Using  the  above  a  priori  statisticzd  data  motivates  the  bound: 

(100) 

This  is  clearly  a  tighter  bound  than  that  obtained  in  Theorem  3,  which  is  using  much  less 
statistical  data  about  the  disturbance  sequence. 
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7  Discussion 


The  parameter  set  estimator  developed  here  can  be  used  either  off-line  or  on-  line.  Most  of 
the  discussion  up  to  now  focuses  on  one  iV-point  fixed  observation  interval  t  6  [0,  A’’  —  1}. 
In  practice,  the  scheme  would  be  implemented  more  or  less  as  follows.  First  solve  the 
least-squares  parameter  estimation  problem  for  a  moving  A-point  observation  interval, 
say,  [#  —  A  -f- 1,  f].  At  certain  times  {t,},  the  current  parameter  set  estimate  would  be  used 
to  update  the  controller. 

There  are  myriad  ways  to  select  the  reset  times  {f,}.  As  an  illustration,  let  the  reset 
times  be 

{ti)  =  {kN  :  parameter  identifiablity  on  [1  -f  (A:  —  1)A,  kN], 

control  design  exists,  (101) 

parameter  set  estimate  G  0o} 

Parameter  identifiability  is  essentially  an  information  test  on  the  data  and  is  equvalent  to 
the  notion  of  persistent  excitation  on  the  observation  interval  [f,-  —  N  +  l,t«].  If  there  is 
no  identifiablity,  then  we  keep  recording  the  data.  Even  with  good  data,  there  is  still  the 
possibility  of  being  unable  to  use  the  resulting  parameter  set  estimate  because  there  is  no 
’’numerical”  solution  for  the  control  design,  e.g.,  whenever  there  is  a  near  pole  and  zero 
cancellation  in  the  model  set  estimate.  Finally,  the  estimated  parameter  set  may  contain 
some  parameters  which  are  outside  the  set  of  prior  information  ©o-  If  this  occurs,  then 
the  ’’outliers”  can  be  projected  back  into  the  prior  set,  or  else  continue  to  record  data.  A 
further  discussion  of  projection  methods  and  resetting  schemes  are  dealt  with  in  Goodwin 
and  Sin(1984)  and  Bai  and  Sastry(1987). 

Solving  the  least-squares  parameter  estimation  problem  on  the  moving  interval  results 
in  the  estimate  at  t,-  given  by 

=  («>#’')■'  (102) 

where  (f>,  z  axe  as  previously  defined,  but  now  {■)■  denotes  averaging  on  the  moving  interval, 

he.,  ^ 

=  4  E  *(<)  (103) 

t=ti-N+l 

The  parameter  identifiability  test  is  to  require  that  the  smallest  eigenvalue  of  is 

larger  than  some  positive  number. 

Using  this  scheme,  the  adaptive  control  system  can  be  viewed  as  an  iterative  mapping 
in  parameter  space.  Specifically,  let  ©i  denote  the  ’’new”  parameter  set  estimate  at  the 
end  of  the  ith  reset  interval  where 

0,  {0  G  ©„  :  110  -  diW  <  (104) 

Hence,  corresponding  to  each  model  set  estimate  0,-,  there  is  a  parameter  estimate  0,  G  ©o 
and  a  radius  of  uncertainty  Si-  The  parameter  set  estimator  produces  the  sequence  of 
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’’new”  parameter  set  estimates 

©i,  i  =  1,2, . . .  (105) 

Correspondingly,  we  have  the  sequence  of  ’’current”  parameter  set  estimates 

0.  =  0i_i,  =  (106) 

where  ©o  is  an  initial  estimate. 

Using  the  above  relations  between  ’’new”  and  ’’current”  parameter  set  estimates,  we 
may  conceptually  express  the  adaptation  as  the  iterative  mapping: 

0.  =  r,{0._i},  i  =  l,2,...  (107) 

The  operator  F,-  {•}  is  a  time- varying  operator  mapping  parameter  sets  into  parameter 
sets  in  IR^.  In  words,  the  control  design  rule  generates  a  controller  based  on  the  ’’current” 
set  estimate  0,_i.  This  controller  is  fixed  during  the  ith  reset  interval  At  the 

end  of  this  interval,  the  ’’current”  estimate  0,_i  is  replaced  by  the  ’’new”  estimate  0,-, 
and  the  controller  is  redesigned.  Essentially,  the  ’’new”  estimate  is  a  function  of  the 
’’current”  (  and  now  past)  estimate  as  well  as  the  signals  present  in  the  closed-loop  system 
during  the  ith  reset  interval.  Thus,  the  operator  F,-  {■}  signifies  the  relation  between  the 
"current”  and  ’’new”  parameter  set  estimates.  A  further  discussion  and  anlysis  of  the 
properties  of  this  operator  can  be  found  in  Phillips,  Kosut,  and  FrankIin(19SS).  It  is 
further  revealed  there  that  fixed-  points  of  this  operator  are  in  fact  equilibriums  of  the 
averaged  system  corresponding  to  either  slow  adaptation  or  least-squares  based  parameter 
adaptive  algorithms.  What  we  have  essentially  shown  in  this  paper  is  that  the  fixed  points 
can  be  restricted  to  a  set  bounded  by  a  priori  knowledge  about  unmodeled  dynamics  and 
disturbances. 
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8  Concluding  Remarks 


We  have  presented  some  theoretical  results  and  simulation  experiments  which  give  credence 
to  the  use  of  a  parameter  set  estimator  in  an  adaptive  control  scheme  where  the  on-  line 
design  rule  is  prepared  to  accept  uncertainties  of  the  same  form.  The  main  result  shows 
that  with  a  sufficient  but  reasonable  amount  of  a  priori  information,  the  parameter  set 
estimator  can  always  capture  the  true  system.  To  make  the  range  of  uncertainty  as  small 
as  is  required  by  control  design  specifications  will  necessitate  an  appropriate  choice  of 
identification  experiment  design  variables,  such  els  data  filters,  and  input  spectrum. 
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A  Appendix 

A.l  Preliminaries 

We  first  state  some  preliminary  definitions  and  results. 


Definition:  The  iiscTeie- Fourier -irans form,  or  DFT,  of  a  sequence  {/}  ont  E  [0,  —  1] 

is  defined  as 

F;v(a;)  =  DFT{/}  =  ^  E 

where  fljv  a.re  the  DFT  frequencies 

The  DFT  admits  an  inverse  transform,  or  IDFT,  such  that 

fit)  =  lDFT{FN}=  E  tE[0,N-l] 

Outside  of  the  interval  [0,iV  —  1],  /(<)  ^  IDFT{Fjv}  in  general,  unless  /  is  iV-periodic. 
We  use  the  notation  /;v  to  denote  the  iV-  periodic  extension  of  /  which  is  defined  for  all 
t  E  (-00,00)  by 

Mt)  =  E 

Observe  that  for  t  E  [0,N  —  l],//v(t)  =  /(<)•  immediate  consequnce  of  the  above 
definitions  is 


Theorem  4  (Parseval’s  Theorem) 

4  E  =  E  ^;v(w)Fjv(u;)* 

t=0 

The  next  result  shows  how  DFT’s  are  tremsformed  by  stable  linear  systems.  The  result  as 
stated  is  a  slight  modification  of  Theorem  2.1  in  Ljung(1987),  and  for  this  reason  a  proof 
is  presented.  A  similar  result  can  also  be  found  in  LaMaire  et  al.  (1987). 

Lemma  2  (Linear  Transformation  of  DFT)  Let  {y,u  :  t  =  0, N  —  1}  denote  se¬ 
quences  with  DFT’s  for  u  E  Flff.  Suppose  that 

y  =  Gu 

where  G  is  a  stable  transfer  function  whose  pulse  response  sequence  {5^}  satisfies 

|^(<)|  <  Mp\  Vt  >  0 
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for  constants  M  >  0,/>  €  (0, 1).  Then: 

|y„(a,)  -  G(enu^(u,)\  <  i  |u(()  - 

where  {uA^(f)}  is  the  N -periodic  extension  of  u(t),  that  is, 

Mi)  =  E  i  e  (-00,00) 

(j£Ojv 


If  u{i)  is  N -periodic,  then 


Y;v(w)  =  Gie^‘^)UN{ij) 


Proof: 

For  i  6  [0,  iV  —  1],  we  have 


y(i)  =  E  gii-k)u{k) 

fc=-eo 

=  E  -  k)u(k)  +  E  ~  k)uN{k) 

Jfer:— 00  k=0 

=  E  g(i-k)u(k)-h  E  gii  -  k)uff{k)  -  E  git  -  k)uNik) 


-1 

E 

fc=-00 


k=-oo  fc=-oo 

=  E  ^(<  -  fc)«iv(A;)  +  E  5'(*  -  *)l«(^)  - 

k=—oo  fc=— 00 

From  the  definition  of  the  DFT, 

Fn(u;)  =  G{enUNi<^)  +  RnH 


where 


Thus, 


1 


N-l 


-1 


RnH  =  ^  E  1  E  ?(<  -  ^)IM)  -  Mk)] 

t=o 


^*=-00  / 

|-RAr(w)|  <  ^  sup  |u(i)  -  Uiv(0l  E  E  “  ^)l 

^  ie(-oo,-l]  t=0  k=-oo 

<  i  sup  |u(<)  -  ^ 

t=0  k=-oo 


-t-fc 


te(-oo,-i] 


=  TF  sup  |u(t)  -  OAr(0|M 


ff /)  E 

\  t=0  /  \fe=:-00 

=  4  sup  i«(o-«iv{Oi^ 

iv  *€(-00,-11  \l-pf\l-pj 


This  completes  the  proof. 

From  the  above  Lemma  we  obteiin: 
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Corllary  2  Let  {y,u  :  t  G  [0, iV  —  1]}  have  respective  DFT’s  {YniUn  '■  ^  G  Let  G 

be  a  stable  transfer  function  whose  pulse  response  sequence  {g}  satisfies 

1^(01  <  Mp\  Vt  >  0 

for  constants  M  >  0,/>  €  (0, 1).  Then: 

(y(Gu))„  =  Y, 


where 


le^l  <  sup  \u(t)\  sup  KO -«iv(01  (V"^'  1 

with  UAr(i)  the  N -periodic  extension  of  u{t). 


A. 2  Proof  of  Theorem  1 

We  staxt  with  the  expression  for  parameter  error  from  Lemma  1,  i.e., 

in  =  +  (^(WfAb«))„ 

where  we  have  defined  ^ 

xj)  =  <t> 

Applying  the  previous  lemma  and  corollary  to  each  term  gives: 

^N  =  -  H  [>MenAA(e-'")ryv(u;)]*M‘^)+  E 

where 

ii««li  <  I 

with  /3  as  defined  in  the  Theorem.  Under  the  assumptions  of  the  theorem,  namely  the 
known  frequency  bounds  on  AB(e^")  and  AA(e^"),  the  indicated  bound  (37)  follows. 
Ovserve  also  that 

sup{||  E  |WF(e^)AA(e'")r«(u,)|-««M||  :  |A*(e'")|  <  |WA(e'")|}  =  ||7*|| 

uj^On 

Thus,  the  bound  (37)  is  asymptotically  (in  N)  tight.  This  completes  the  proof. 
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